Cytochrome P450 oxygenases

ABSTRACT

Nucleic acids encoding cytochrome P450 variants are provided. The cytochrome P450 variants of have a higher alkane-oxidation capability, alkene-oxidation capability, and/or a higher organic-solvent resistance than the corresponding wild-type or parent cytochrome P450 enzyme. A preferred wild-type cytochrome P450 is cytochrome P450 BM-3. Preferred cytochrome P450 variants include those having an improved capability to hydroxylate alkanes and epoxidate alkenes comprising less than 8 carbons, and have amino acid substitutions corresponding to V78A, H236Q, and E252G of cytochrome P450 BM-3. Preferred cytochrome P450 variants also include those having an improved hydroxylation activity in solutions comprising co-solvents such as DMSO and THF, and have amino acid substitutions corresponding to T235A, R471A, E494K, and S1024E of cytochrome P450 BM-3.

[0001] This application claims priority from U.S. Provisional Application Serial No. 60/306,766, filed Jul. 20, 2001, and U.S. Provisional Application Serial No. 60/308,429, filed Jul. 27, 2001. Each of these prior applications is hereby incorporated by reference in its entirety.

[0002] The Government has certain rights to this invention pursuant to Grant Nos. DBI-9807460 (ETF) and BES-9981770 (FHA) awarded by the National Science Foundation.

FIELD OF THE INVENTION

[0003] The invention relates to novel uses for cytochrome P450 BM-3. In addition, the invention relates to novel variants of cytochrome P450 enzymes which are more resistant to organic solvents and/or are capable of oxidizing alkanes or alkenes.

BACKGROUND

[0004] The term paraffin (from Latin parum affinis=slight affinity) accurately reflects the nature of alkanes: these compounds are notoriously inert, and activating their C—H bonds presents a difficult chemical obstacle. In fact, one of the great challenges of contemporary catalysis is the controlled oxidation of hydrocarbons (Shilov 1997). Processes for controlled, stereo- and regioselective oxidation of hydrocarbon feed stocks to more valuable and useful products such as alcohols, ketones, acids, and peroxides would have a major impact on the chemical and pharmaceutical industries. However, selective oxyfunctionalization of hydrocarbons thus remains one of the great challenges for contemporary chemistry. Despite decades of effort, including recent advances (Chen et al., 2000; Hartman and Ernst, 2000; Thomas et al., 2001), chemical catalysts for alkane functionalization are characterized by low yields, poor selectivity and harsh conditions.

[0005] Biocatalysts (enzymes) that oxidize alkanes allow organisms to utilize hydrocarbons as a source of energy and cellular building blocks (Ashraf et al., 1994; Watkinson and Morgan, 1990). Enzymes have unique properties that distinguish them from most chemical catalysts. Most impressive is their ability to catalyze specific, and often difficult, chemical reactions in water at room temperature and atmospheric pressure. Forty years of screening alkane-assimilating organisms (Leadbetter and Foster, 1959) have identified a variety of multi-subunit, membrane-associated enzyme complexes, which have inspired curiousity and mimicry for their ability to catalyze selective oxidations at room temperature and ambient pressure (Scheller et al., 1996; Stevenson et al., 1996; Fox et al., 1990; Fisher et al., 1998; Benson et al., 1979). However, low catalyst turnover rates and limited stability make applications of biocatalytic C—H bond activation feasible only in a very few industrial processes where high value compounds are produced (Schmid et al., 2001).

[0006] Monooxygenases have unique properties that distinguish them from most chemical catalysts. Most impressive is their ability to catalyze the specific hydroxylation of non-activated C—H; one of the most useful biotransformation reactions, which is often difficult to achieve by chemical means, especially in water, at room temperature under atmospheric pressure. However, for chemical synthesis, organic solvents, not aqueous solutions, are generally used. The use of organic solvents has many advantages, most importantly are a) higher solubility of often in aqueous solutions poor soluble nonpolar compounds; b) suppression of water-dependent side reactions; c) alteration in enantioselectivity; and d) elimination of microbial contaminations (Dordick, 1992). The main drawback of enzymes functioning in organic solvents is their drastically reduced catalytic activity caused by dehydration of the enzyme(Klibanov, 1997). Little is known about this process and mainly hydrolytic enzymes such as esterases and lipases were used to study and improve their activity and stability in organic solvents (Kvittingen et al., 1992). Cofactor dependent oxidative enzymes have multiple domains and highly regulated electron transfer mechanisms to transport the reduction equivalent to the catalytic heme center (Munro et al., 1996; Beratan, 1996; Moser et al., 1995). Organic solvents can interfere by affecting redox potentials and interactions between single domains. However, no theory has been developed to explain the influence of organic solvents toward complex oxidative enzymes. Thus, the low organic solvent resistance of enzymes, in particular enzymes suitable for oxidation of hydrophobic substances, is a particularly challenging problem.

Cytochrome P450 Monoxyzenases

[0007] The cytochrome P450 monooxygenases (“P450s”) is a group of widely-distributed heme enzymes that inserts one oxygen atom of O₂ into a diverse range of hydrophobic substrates, often with high regio- and stereoselectivity. The second oxygen atom is reduced to H2O. The active sites of all cytochrome P450s contain an iron protoporphryin IX with cysteinate as the fifth ligand; the final coordination site is left to bind and activate molecular oxygen (Groves et al., 1995). For many of the P450-catalyzed reactions, no chemical catalysts come close in performance (Lewis, 1996). These enzymes, however, are often only poorly active towards nonnatural substrates and cannot tolerate normal process conditions, including organic solvents (Lewis, 1996; Kuhn-Velten, 1997). Simply put, they are a process engineering nightmare.

[0008] One particular P450 enzyme, cytochrome P450 BM-3 from Bacillus megaterium (EC 1.14.14.1) also known as CYP102, is a water-soluble, catalytically self-sufficient P450 containing a monooxygenase domain (64 kD) and a reductase domain (54 kD) in a single polypeptide chain (Narhi and Fulco, 1986 and 1987; Miura and Fulco, 1975; Ruettinger et al., 1989). The minimum requirements for activity are substrate, dioxygen and the cofactor nicotinamide adenine dinucleotide phosphate (NADPH). Nucleotide and amino acid sequences for P450 BM-3 can be found in, and are hereby incorporated by reference from, the GenBank database under the accession Nos. J04832 (SEQ ID NO: 1) and P14779 (SEQ ID NO: 2), respectively.

[0009] P450 BM-3 hydroxylates fatty acids of chain length between C12 and C18 at subterminal positions, and the regioselectivity of oxygen insertion depends on the chain length (Miura and Fulco, 1975; Boddupalli et al., 1990). The optimal chain length of saturated fatty acids for P450 BM-3 is 14-16 carbons, and the enzyme was initially believed to have no activity towards fatty acids smaller than C12 (Miura and Fulco, 1975). P450 BM-3 is also known to hydroxylate the corresponding fatty acid amides and alcohols and forms epoxides from unsaturated fatty acids (Miura and Fulco, 1975; Capdevila et al., 1996; Graham-Lorence et al., 1997; Ruettinger and Fulco, 1981). The enzyme was reported to be inactive towards alkanes and methyl esters lacking the polar functionality of the natural substrates (Miura and Fulco, 1975). However, there were indications that P450 BM-3 could accept shorter-chain alkanes, although with very low activity (Munro et al., 1993).

[0010] Powerful techniques for creating enzymes with modified or improved properties are now available, such as directed evolution (Arnold, 1998), in which iterative cycles of random mutagenesis, recombination and functional screening for improved enzymes accumulate the mutations that confer the desired properties. Mutant P450 BM-3 enzymes with modified activity have now been reported in the literature. For example, an F87A (Phe87→Ala87) mutant was found to display a higher activity for the 12-pCNA substrate (Farinas et al., 2001; Schwaneberg et al., 1999). In addition, by reengineering the fatty acid binding site to accommodate fatty acids with a chain-length shorter than 12 (Li et al., 2001; Ost et al., 2000), Li and coworkers (2001) found mutants that are capable to hydroxylate indole which dimerizes in the presence of oxygen to indigo and indinlbin, and Carmichael and Wong (2001) found P450 BM-3 mutants that could oxidize polycyclic aromatic hydrocarbons (“PAHs”) such as phenanthrene, fluoranthene, and pyrene. In addition, the Schmid group recently reported mutants of P450 BM-3 that can hydroxylate a variety of nonnatural substrates, including octane, several aromatic compounds and heterocyclic compounds (Appel et al., 2001). In addition, P450 BM-3 mutants for epoxidation of substrates such as long-chain unsaturated fatty acids (Miura and Fulco, 1975; Capdevila et al., 1996; Graham-Lorence et al., 1997; Ruettinger and Fulco, 1981) and styrene (Fruetel, J A et al., 1994) have been suggested.

[0011] Many of the wild-type and mutant P450 BM-3 substrates have, if the performance of the catalyst is sufficient, bright prospective applications as products or intermediates in fine chemical synthesis (Schneider et al., 1999). Unfortunately, until now, the alkane hydroxylation activities of P450 BM-3 mutants are still limited, especially for lower alkanes. Moreover, many of P450 BM-3 substrates (fatty acids, alcohols, amides, C>12 alkanes, polyaromatic hydrocarbons, heterocycles, etc.), are notoriously insoluble in aqueous solution and require for solubilization an organic co-solvent, and the organic solvent resistance of P450 BM-3 in water miscible co-solvents is low and insufficient for industrial applications. To date, no P450 BM-3 mutants with improved or altered solvent resistance have been identified.

[0012] Thus, there is a need in the art for industrially useful oxidation catalysts for alkane hydroxylation and alkene epoxidation, particularly on hydrocarbon substrates that are shorter than its preferred substrates, the fatty acids. While various alkane hydroxylases are known, for example w-hydroxylase and methane monooxygenase, none of these naturally-occurring enzymes have the practical advantages of an enzyme such as P450 BM-3, which is highly expressed in recombinant form in bacteria and contains all its functional domains in a single polypeptide chain. There is also a need for oxygenase enzymes that can operate efficiently in organic co-solvents. This invention addresses these and other needs in the art.

SUMMARY OF THE INVENTION

[0013] The present invention is based on the discovery of P450 BM-3 variants that have improved alkane oxidation activity, alkene oxidation activity, and/or improved stability in organic co-solvents.

[0014] Accordingly, the invention provides an isolated nucleic acid encoding a cytochrome P450 variant which has a higher capability than the corresponding wild-type cytochrome P450 to oxidize at least one substrate selected from an alkane comprising a carbon-chain of no more than 8 carbons and an alkene comprising a carbon-chain of no more than 8 carbons, wherein the wild-type cytochrome P450 comprises an amino acid sequence having at least 60% sequence identity to SEQ ID NO: 2 (i.e., the wild-type sequence of P450 BM-3), and the cytochrome P450 variant comprises an amino acid substitution at a residue corresponding to a residue of SEQ ID NO: 2 selected from V78, H236, and E252. In one embodiment, the amino acid sequence has at least 80% sequence identity to SEQ ID NO: 2. Preferably, the amino acid sequence is SEQ ID NO: 2. In another embodiment, the higher capability is a higher maximum turnover rate of the substrate into an oxidized product, and the maximum turnover rate of the variant is at least 5 times, preferably at least 10 times, the maximum turnover rate of the wild-type. When the substrate is an alkane, the capability to oxidize is preferably the capability to hydroxylate, and the alkane can be selected from one or more of octane, hexane, cyclohexane, pentane, butane, and propane. When the substrate is an alkene, the capability to oxidize can be either or both of the capability to epoxidate or hydroxylate, and the alkene can be selected from one or more of 1-hexene, 2-hexene, 3-hexene, cyclohexene, isoprene, allyl chloride, propene, and styrene.

[0015] In another embodiment, the cytochrome P450 variant comprises amino acid substitutions at residues corresponding to at least two residues of SEQ ID NO: 2 selected from V78, H236, and E252, preferably at all of these residues, and, even more preferably, the amino acid substitution is selected from V78A, H236Q, and E252G. When the wild-type sequence is that of P450 BM-3, the cytochrome P450 variant can comprise amino acid substitutions at two residues selected from V78, H236, and E252, preferably all of these residues, and, even more preferably, the amino acid substitutions are V78A, H236Q, and E252G. In addition, the cytochrome P450 variant may comprise at least one further amino acid substitution at a residue selected from H138, T175, V178, A184, N186, F205, D217, S226, R255, A290, A295, L353, and G396. Optionally, the amino acid substitution at these residues is selected from one or more of H138Y, T175I, V178I, A184V, N186D, F205C, D217V, S226I, S226R, R255S, A290V, A295T, L353V, and G396M. In preferred embodiments, the cytochrome P450 variant comprises the amino acid substitutions V78A, H138Y, T175I, V178I, A184V, H236Q, E252G, R255S, A290V, A295T, and L353V, or the amino acid substitutions V78A, T175I, A184V, F205C, S226R, H236Q, E252G, R255S, A290V, and L353V.

[0016] The invention also provides an isolated nucleic acid encoding a cytochrome P450 variant which has a higher organic solvent-resistance than the corresponding wild-type cytochrome P450, wherein the wild-type cytochrome P450 comprises an amino acid sequence has at least 60% sequence identity to SEQ ID NO: 2, and the cytochrome P450 variant comprises an amino acid substitution at a residue corresponding to a residue of SEQ ID NO: 2 selected from T235, R471, E494, and S1024. In one embodiment, the amino acid sequence has at least 80% sequence identity to SEQ ID NO: 2. Preferably, the amino acid sequence is SEQ ID NO: 2. The organic solvent-resistance may be, for example, a higher maximum turnover rate of a substrate into an oxidized product in a solution comprising an organic solvent, and the oxidized product may be a hydroxylated product. In another embodiment, the organic solvent is selected from THF, DMSO, methanol, ethanol, propanol, dioxane, and dimethylformamide. For example, when the solution comprises 25% (v/v) DMSO, the hydroxylation activity of the cytochrome P450 BM-3 variant can be at least twice the hydroxylation activity of wild-type cytochrome P450 BM-3. Also, when the solution comprises 2% (v/v) THF, the hydroxylation activity of the cytochrome P450 BM-3 variant can be at least twice the hydroxylation activity of wild-type cytochrome P450 BM-3.

[0017] In particular embodiments, the cytochrome P450 variant may comprise amino acid substitutions at residues corresponding to at least two residues of SEQ ID NO: 2 selected from T235, R471, E494, and S1024; preferably at least three residues, an the amino acid substitutions at residues may correspond to T235, R471, E494, and S1024 of SEQ ID NO: 2. Alternatively, when the wild-type sequence is SEQ ID NO: 2, the cytochrome P450 variant may comprise amino acid substitutions at least two residues selected from T235, R471, E494, and S1024, preferably three, and, even more preferably, the cytochrome P450 variant comprises amino acid substitutions at T235, R471, E494, and S1024. Optionally, the amino acid substitutions can be T235A, R471A, E494K, and S1024E. In any of the foregoing embodiment, the cytochrome P450 variant further may further comprise an amino acid substitution at residue F87, for example, F87A. In a preferred embodiment, the cytochrome P450 variant comprises the mutations F87A, T235A, R471A, E494K, and S1024E.

[0018] The invention also provides for an isolated nucleic acid encoding a variant of a parent cytochrome P450 oxygenase, the variant having (i) a higher ability than the parent to oxidize a substrate selected from an alkane comprising a carbon-chain of no more than 8 carbons or alkene comprising a carbon-chain of less than 8 carbons; and (ii) at least one amino acid substitution in a secondary structural element of the cytochrome P450 heme domain selected from the helix B′ domain, the helix H domain, and the helix I domain, wherein the parent comprises an amino acid sequence having at least 60% sequence identity to SEQ ID NO: 2. Preferably, the amino acid sequence of the parent has at least 80% sequence identity to SEQ ID NO: 2. Even more preferably, the amino acid sequence of the parent is SEQ ID NO: 2. The amino acid substitution can, for example, be at a residue corresponding to a residue of SEQ ID NO: 2 selected from V78, H236, and E252, and correspond to V78A, H236Q, or E252G. Optionally, the variant further comprises at least one amino acid substitution at a residue corresponding to a residue of SEQ ID NO: 2 selected from H138, T175, V178, A184, F205, S226, R255, A290, A295, and L353, such as, for example, V78A, H138Y, T175I, V178I, A184V, F205C, S226R, H236Q, E252G, R255S, A290V, A295T, and L353V. Preferably, the amino acid substitutions are at residues corresponding to amino acid residues V78, H236, and E252 of SEQ ID NO: 2, such as, for example, V78A, H236Q, and E252G.

[0019] The invention also provides for an isolated nucleic acid encoding a variant of a parent cytochrome P450 oxygenase, the variant having: (i) a higher organic solvent resistance than the parent; and (ii) at least one amino acid substitution in a secondary structural element of the selected from the helix H domain of the cytochrome P450 heme domain and the helix of the flavin domain; wherein the parent comprises an amino acid sequence having at least 60% sequence identity to SEQ ID NO: 2. In one embodiment, the amino acid sequence of the parent has at least 80% sequence identity to SEQ ID NO: 2. Preferably, the amino acid sequence of the parent is SEQ ID NO: 2. The amino acid substitution can be at a residue corresponding to a residue of SEQ ID NO: 2 selected from T235 and E494, and the cytochrome P450 variant may optionally comprise a further amino acid substitution at a residue corresponding to a residue of SEQ ID NO: 2 selected from R471 and S1024. If so, the amino acid substitution can be selected from T235A and E494K, and the further amino acid substitution can be selected from R471A and S1024E. In a preferred embodiment, the amino acid substitution is selected from T235A and E494K, and the further amino acid substitution is selected from R471A and S1024E. In another preferred embodiment, the variant further comprises an amino acid substitution at a residue corresponding to residue F87 of SEQ ID NO: 2, such as, for example, F87A, F87G, F87V, F87I, F87F, F87W, F87D, F87N, F87H, F87K, or F87R. A preferred variant comprises amino acid substitutions at residues corresponding to amino acid residues T235, R471, E494, and S1024 of SEQ ID NO: 2, and the amino acid substitutions are preferably T235A, R471A, E494K, and S1024E, with or without the amino acid substitution F87A.

[0020] The invention provides for an isolated nucleic acid encoding a cytochrome P450 variant, the cytochrome P450 variant comprising the amino acid substitutions V78A, H236Q, and E252G of SEQ ID NO: 2, wherein the variant may further comprise the amino acid substitutions H138Y, T175I, V178I, A184V, N186D, D217V, S226I, R255S, A290V, A295T, L353V, and G396M, or the amino acid substitutions T175I, A184V, F205C, S226R, R255S, A290V, L353V.

[0021] The invention also provides for an isolated nucleic acid encoding a cytochrome P450 variant, the cytochrome P450 variant comprising the amino acid substitutions T235A, R471A, E494K, and S1024E of SEQ ID NO: 2, optionally comprising the amino acid substitution F87A.

[0022] Finally, the invention also provides for amino acid sequences comprising the above-mentioned amino acid substitutions.

[0023] The above features and many other advantages of the invention will become better understood by reference to the following detailed description when taken in conjunction with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

[0024]FIG. 1. Gas chromatogram of the oxidation products of octane, catalyzed by wild-type P450 BM-3.

[0025]FIG. 2. The screening assay for alkane oxidation activity uses the substrate analog 8-pnpane (p-nitrophenoxyoctane). Terminal hydroxylation generates the unstable hemiacetal, which decomposes to the aldehyde and p-nitrophenolate, which is monitored at 410 nm.

[0026]FIG. 3. P450 BM-3 screening procedure. Library of P450 BM-3 mutant genes is transformed into E. coli and plated on agar, from which single colonies are picked into 96-well plates and grown overnight. From these plates, samples are taken to inoculate fresh 96-well plates, in which the enzymes are expressed and assayed for hydroxylation activity. The plates from the overnight growth are stored at 4° C. and used to isolate active clones identified in the assay.

[0027]FIG. 4. Progression of activity for P450 BM-3, as measured on 8-pnpane.

[0028]FIG. 5. Initial rates of NADPH oxidation vs alkane concentration for purified P450 BM-3 mutant IX139-3. Octane: k_(cat)=35.6 s⁻¹, Km=9.1×10⁻⁴ M and k_(cat)/K_(m)=3.9×10⁴ M⁻¹s⁻¹. Hexane: k_(cat)=34.3 s⁻¹, K_(m)=4.9×10⁻⁴ M and k_(cat)/K_(m)=7.0×10⁴ M⁻¹s⁻¹. Cyclohexane: k_(cat)=31.0 s⁻¹, K_(m)=8.4×10⁻⁵ M and k_(cat)/K_(m)=3.7×10⁵ M⁻¹s⁻¹.

[0029]FIG. 6. Gas chromatogram of the oxidation products of hexane, catalyzed by IX79_(—)1.

[0030]FIG. 7. Gas chromatogram of the oxidation products of cyclohexane, catalyzed by IX79_(—)1.

[0031]FIG. 8. Gas chromatogram of the oxidation products of butane, catalyzed by IX139-3.

[0032]FIG. 9. Alternative representation of the pNCA assay principle (see also FIG. 2).

[0033]FIG. 10. Re-screen results of 1st mutant generation, investigating organic solvent resistance in 1% (v/v) (see Example 2).

[0034]FIGS. 11A and B. Mutagenic pathways for parent F87A and the co-solvents DMSO (A) and THF (B) (see Example 2), showing increased residual activity versus enzyme generation.

[0035]FIGS. 12A to C. Organic solvent resistance profile of the parent, showing organic solvent resistance against percentage of solvent (see Example 2). (A) DMSO as solvent. (B) THF as solvent. (C) Purified wild-type, F87A, and F87ASB3, using DMSO as solvent.

[0036]FIGS. 13A to F. Organic solvent resistance profile of back-mutated BM-3 variants. Solvent: A=DMSO; B=THF; C=acetone; D=acetonitrile; E=DMF; and F=ethanol.

[0037]FIG. 14. Maximum turnover rates (mole substrate/min/mole enzyme) for P450 BM-3 wildtype (black bars) and IX139-3 (shaded bars) on alkane and fatty acid substrates.

[0038]FIG. 15. Maximum rates reported for alkane hydroxylation by alkane monooxygenases CYP4B111, CYP52A38, P450cam9, AlkB18, and sMMO10. Rates for P450 BM-3 wildtype and mutant IX139-3 were determined in this work.

[0039]FIGS. 16A and B. Optical spectra for IX139-3 and wildtype P450 BM-3. (A) Mutant IX139-3 (0.5 mM) in potassium phosphate buffer (0.1 M, pH 8) (-▴-); with laurate (0.5 mM, solid line) in 1% methanol; hexane (1.0 mM, -▪-) in 1% methanol; propane (saturated solution, --). (B) Wildtype P450 BM-3 in potassium phosphate buffer (0.1 M, pH 8) (-▴-); with laurate (0.5 mM, solid line) in 1% methanol; hexane (1.0 mM, 1.0 mM, -▪-) in 1% methanol; propane (saturated solution, --).

[0040]FIG. 17. Initial rates of NADPH consumption (mole substrate/min/mole enzyme) in the presence of alkenes for wild-type (black bars) and X139-3 (shaded bars). “ND” indicates that NAPDH consumption was not detectable over background.

[0041]FIG. 18. Gas chromatogram of the oxidation products of styrene (A) and propene (B) catalyzed by IX139-3.

[0042]FIGS. 19A and B. (A) Absorption spectra in the presence of IX139-3 after the 4-NBP assay with isoprene and NADPH (∘), NADPH without isoprene (□), and isoprene without NADPH (Δ). (B) Absorption spectra in the presence of IX139-3 after the NBP assay with styrene and NADPH (∘), NADPH without styrene (□), and styrene without NADPH (Δ).

[0043]FIGS. 20A and B. Sequence alignments of P450 BM-3 (GenBank Accession No. P14779 (SEQ ID NO: 2)) with CYP 2C3 (GenBank P00182, SEQ ID NO: 3), CYP 2C9 (GenBank P11712; SEQ ID NO: 4), CYP 2D1v (GenBank P10633; SEQ ID NO: 5), and CYP 108 (GenBank P33006; SEQ ID NO: 6).

[0044]FIGS. 21A to F. Representative topology diagrams of the heme domain of P450 variants of the invention, based on P450BM-P; the heme domain of P450 BM-3. (A) topology of P450BM-P; the topology is depicted with helices represented by black bars, and the length of each of the bars is in approximate proportion to the length of the helix. The strands of β-sheets are shown with arrows. The strands are grouped by the secondary structural elements which they comprise. The structural elements are grouped into the α-helical-rich domain and the β-sheet-rich domain. The heme is shown by the square at the NH₂-terminal end of the L-helix. With only minor modifications, this topology diagram could be used for other P450s (Peterson et al., 1995). (B) Topology of P450 BM-3, showing location of the heme group. (C) Location of residues H138 and A78 relative to the heme group. (D) Location of residues T175, V178, A184, N186, and D217 relative to the heme. (E) Location of residues H236, R255, and E252 relative to the heme. (F) Location of residues L353, G396, A290, and A295 relative to the heme. Those sections labeled A, beta1, beta3, beta4, D, E, F, G, I, K, and J denote secondary structural elements conserved in P450s.

[0045]FIG. 22. Positions of the amino acid substitutions in P450 BM-3 mutant IX139-3. The variant contains 11 amino acid substitutions, which are represented as spheres on the crystal structure of the substrate-bound enzyme (PDB: 1FAG). Five are clustered on the highly flexible F-G helix-loop-helix structure and the I helix along which it slides during substrate binding and release.

DETAILED DESCRIPTION OF THE INVENTION

[0046] The invention provides novel variants or mutants of P450 BM-3 which are capable of oxidizing alkanes, alkane derivatives, and/or saturated or unsaturated hydrocarbons. Accordingly, the invention provides variant P450 BM-3 enzymes which have a higher oxidation activity towards at least one alkane, alkane derivative, alkene, or alkene derivative than wild-type P450 BM-3. Preferably, the variant P4590 BM-3 has a higher oxidation activity towards a saturated hydrocarbon such as octane, hexane, cyclohexane, propane, ethane, and/or butane, or towards an unsaturated hydrocarbon such as propene, hexene, cyclohexene, isoprene, allyl chloride, and/or styrene, or derivatives thereof, than wild-type P450 BM-3. In one embodiment, the P450 BM-3 variants comprise mutations at one or more of the following residues of SEQ ID NO: 2, counting the starting methionine as position 0 (“zero”): V78, H138, T175, V178, A184, N186, F205, D217, S226, H236, E252, R255, A290, A295, L353, and G396. Preferably, the mutation or mutations are selected from V78A, H138Y, T175I, V178I, A184V, N186D, F205C, D217V, S226I, S226R, H236Q, E252G, R255S, A290V, A295T, L353V, and G396M. The P450 BM-3 variants can comprise at least one, preferably at least three, and more preferably at least 5, and even more preferably at least all of these amino acid mutations. In a particularly preferred embodiment, the P450 BM-3 variant comprises the mutations V78A, H236Q, and the E252G mutations. See also Table 1A.

[0047] In addition, the invention provides P450 BM-3 variants with a higher organic solvent resistance in at least one water-miscible co-solvent than wild-type P450 BM-3. Preferably, the P450 BM-3 variants of the invention have a higher organic solvent resistance towards water-miscible co-solvents such as, but not limited to, DMSO, THF, methanol, ethanol, propanol, dioxane, and dimethylfonamide. In a preferred embodiment, the P450 BM-3 variant comprises mutations at one or more of the following residues of SEQ ID NO: 2, counting the starting methionine as position 0 (“zero”): F87, T235, R471, E494, and S1024. The mutation at F87 can be, for example, F87A, F87G, F87V, F87I, F87F, F87W, F87D, F87N, F87H, F87K, F87R. The mutations at T235, R471, and E494 can be, for example, T235A, R471C, R471A, and E494K. The mutation at S1024 can be, for example, S1024R, S1024T, S1024K, and S1024E. In a preferred embodiment, the novel P450 BM-3 variants comprise mutations in at least one, preferably at least three of amino acid residues F87, T235, R471, E494, and S1024. Most preferably, the variant comprises the mutations T235A, R471A, E494K, and S1024E, with or without the mutation F87A. See also Table 1 A. TABLE 1A Preferred Cytochrome P450 Mutated Amino Acid Residues and Mutations Amino Acid Residue of SEQ ID NO: 2 Amino Acid Mutation V78 V78A F87 F87A H138 H138Y T175 T175I V178 V178I A184 A184V N186 N186D F205 F205C D217 D217V S226 S226I, S226R T235 T235A H236 H236Q E252 E252G R255 R255S A290 A290V A295 A295T L353 L353V G396 G396M R471 R471A E494 E494K S1024 S1024E

[0048] In addition, the invention provides for P450 BM-3 mutants having specific nucleic acid and amino acid sequences. The nucleic acid sequences include those which encode for the P450 BM-3 variants in Table 1B. The amino acid sequences include those which have the combinations of amino acid mutations in Table 1B, where all mutations refer to SEQ ID NO: 2, counting the starting methionine residue as position 0 (“zero”). TABLE 1B Preferred P450 BM-3 Variants Amino Acid Mutations Designation in Wild-Type P450 BM-3 (SEQ ID NO: 2) ix139-3 V78A, H138Y, T175I, V178I, A184V, H236Q, E252G, R255S, A290V, A295T, L353V ix139-37 V78A, T175I, A184V, N186D, D217V, H236Q, E252G, A290V, L353V, G396M ix139_43 V78A, T175I, A184V, S226I, H236Q, E252G, A290V, L353V J V78A, T175I, A184V, F205C, S226R, H236Q, E252G, R255S, A290V, L353V F87A5F5 F87A, T235A, R471A, E494K, S1024E F87ABC1F10 F87A, T235A, R471A, S1024T W5F5 T235A, R471A, E494K, S1024E

[0049] Moreover, the invention provides for novel variants of P450 enzymes other than P450 BM-3, which have a higher activity for alkane oxidation and/or higher solvent resistance than the corresponding wild-type enzyme. These novel variants can be identified when aligned with the respective non-BM-3 amino acid sequence with that of P450 BM-3 (SEQ ID NO: 2), amino acid positions in the non-BM-3 sequence that are aligned with one or more of the following amino acid residues in SEQ ID NO: 2 are identified: V78, F87, H138, T175, V178, A184, N186, F205, D217, S226, T235, H236, E252, R255, A290, A295, L353, G396M, R471, E494, and S1024. Mutations in amino acid residues of the non-BM-3 enzyme which are aligned with and identical to the aforementioned BM-3 amino acid residues results in novel P450 variants according to the invention. See FIG. 20. Preferably, the mutation in the non-BM-3 sequence results in one or more of the following amino acid substitutions: V78′A, F87′A, F87′G, F87′V, F87′I, F87′F, F87′W, F87′D, F87′N, F87′H, F87′K, F87′R, H138′Y, T175′I, V178′I, A184′V, N186′D, F205′C, D217′V, S226′I, S226′R, T235′A, H236′Q, E252′G, R255′S, A290′V, A295′T, L353′V, G396′M, R471′C, R471′A, E494′K, S1024′R, S1024′T, S1024′K, and S1024′E, where the amino acid residue is the amino acid residue aligned with the corresponding P450- BM-3 residue (denoted by prime (′) sign). Preferred, non-limiting examples of such novel “non-BM-3” enzymes are listed in Table 2. TABLE 2 Preferred Non-P450 BM-3 Variants Wild-Type Wild-Type Amino Aligned P450 BM-3 Enzyme Acid Sequence Mutation(s) Mutation(s) CYP 2C3 SEQ ID NO: 3 H266Q H236′Q CYP2C9 SEQ ID NO: 4 E285G E252′G CYP 2D1v SEQ ID NO: 5 E296G, L398V E252′G, L353′V CYP 108 SEQ ID NO: 6 A293V A290′V

[0050] In addition, the invention provides for variants of non-BM-3 enzymes, wherein the wild-type sequences are at least 30, preferably at least 50, more preferably at least 70, even more preferably at least 90%, and optimally at least 95% identical to SEQ ID NO: 2. Preferred, non-limiting examples of such novel “non-BM-3 P450s” and their heme domains are described herein, listed in Table 2 and depicted in FIG. 20. In one embodiment, the oxidase activity of a P450 variant for one or more alkane- or alkene-substrates is at least five, more preferably at least 10, and even more preferably at least 15 times that of the corresponding wild-type cytochrome P450. In another embodiment, the organic solvent resistance of a P450 variant is at least two, preferably at least 3, and even more preferably at least five times that of the corresponding wild-type cytochrome P450.

[0051] In addition, the “non-BM-3 P450” may be a P450 BM-3 variant, which has one or more mutations as compared to wild-type P450 BM-3. Such variants, including variants displaying more than 60% sequence identity to SEQ ID NO: 2, are described in, e.g., PCT application PCT/US02/11954, filed Apr. 16, 2002.

Wild-Type Cytochrome P450 Enzymes

[0052] Crystal structures of wildtype P450 BM-3 with and without substrate reveal large conformational changes upon substrate binding at the active site (Haines et al., 2001; Li and Poulos, 1997; Paulsen and Ornstein, 1995; and Chang and Loew, 2000). The substrate free structure displays an open access channel with 17 to 21 ordered water molecules. Substrate recognition serves as a conformational trigger to close the channel, which dehydrates the active site, increases the redox potential, and allows dioxygen to bind to the heme.

[0053] The activity of P450 BM-3 on saturated fatty acids follows the order C15 =C16>C14>C17>C13>C18>C12 (Oliver et al., 1997). On the C16 fatty acid, k_(cat)=81 s⁻¹ and K_(m)=1.4×10⁻⁶ M (k_(cat)/K_(m)=6.0×10⁷ M⁻¹s⁻¹). With the C12 fatty acid, kcat=26 s⁻¹, K_(m)=136×10⁻⁶ M and k_(cat)/K_(m)=1.9×10⁵ M⁻¹s⁻¹ (Oliver et al., 1997).

[0054] P450 BM-3 may be compared to naturally-occurring enzyme that hydroxylates linear alkanes. For example, Pseudomonas oleovorans is able to oxidize n-alkanes using hydroxylase machinery comprising an integral membrane oxygenase (ω-hydroxylase), a soluble NADH-dependent reductase and a soluble metalloprotein (rubredoxin) which transfers electrons from the reductase to the hydroxylase (Staijen et al., 2000). The ω-hydroxylase has been cloned from P. oleovorans into Escherichia coli, where it has been expressed and purified (Shanklin et al., 1997). The specific activity of this ω-hydroxylase for octane (5.2 units/mg hydroxylase (Shanklin et al., 1997)) is ˜13 times greater than that of P450 BM-3 (0.4 units/mg enzyme) (See Example 1). (The specific activity of the complete P. oleovorans system, including the rubredoxin and the reductase, is of course less than 5.2 units/mg). Thus, wildtype P450 BM-3 is inefficient relative to this (and other) naturally occurring enzyme for alkane hydroxylation.

[0055] A tyrosine (Tyr51) at the entrance to the substrate-binding pocket makes a hydrogen bond to the carboxylate group of the substrate in the crystal structure of the enzyme bound with palmitoleic acid (Li and Poulos, 1997). Arg 47, also at the entrance to the binding pocket, may form an ionic interaction as well. Nonpolar alkane substrates must rely solely on hydrophobic partitioning into the enzyme's extended substrate channel, and poor substrate recognition may contribute to P450 BM-3's sluggish activity on octane and other alkanes or alkenes.

Directed Evolution

[0056] The present invention provides evolved enzymes which oxidize alkanes to a higher degree, or which have a higher resistance to organic solvents and/or co-solvents, than the corresponding wild-type enzyme(s). As described in Example 3, a P450 BM-3 variant according to the invention has been generated which surpasses the activity of the alkane hydroxylase from P. oleovorans on octane. The mutant also showed similar high activity on hexane, cyclohexane, and pentane, which was not shown to be a substrate for P450 BM-3 before, and is also efficient on butane and propane. Thus, the cytochrome P450 variants of the invention show that it is possible to reach activities on unreactive nonnatural substrates that are close to the activity of the native enzyme, e.g., P450 BM-3, on its best natural substrates (for P450 BM-3; long chain fatty acids), which are about 1000 times higher than that of eukaryotic P450s and one of the highest activities of P450s known so far.

[0057] The strategy described here also provides improvements of the P450 BM-3 activity on a number of substrates, including shorter chain alkenes such as propene, allyl chloride, isoprene, 1-hexene and styrene. It should also be possible to target other key properties such as regioselectivity, enantioselectivity and catalyst stability.

[0058] A preferred technique to improve the alkane-oxidation and co-solvent resistance of wild-type or parent cytochrome P450 enzymes, including P450 BM-3, is directed evolution. General methods for generating libraries and isolating and identifying improved proteins according to the invention using directed evolution are described briefly below. More extensive descriptions can be found in, for example, Arnold (1998); U.S. Pat. Nos. 5,741,691; 5,811,238; 5,605,793 and 5,830,721; and International Applications WO 98/42832, WO 95/22625, WO 97/20078, WO 95/41653 and WO 98/27230.

[0059] The basic steps in directed evolution are (1) the generation of mutant libraries of polynucleotides from a parent or wild-type sequence; (2) (optional) expression of the mutant polynucleotides to create a mutant polypeptide library; (3) screening/selecting the polynucleotide or polypeptide library for a desired property of a polynucleotide or polypeptide; and (4) selecting mutants which possess a higher level of the desired property; and (5) repeating steps (1) to (5) using the selected mutant(s) as parent(s) until one or more mutants displaying a sufficient level of the desired activity have been obtained. The property can be, but is not limited to, alkane oxidation capability and solvent-resistance.

[0060] The parent protein or enzyme to be evolved can be a wild-type protein or enzyme, or a variant or mutant. The parent polynucleotide can be retrieved from any suitable commercial or non-commercial source. The parent polynucleotide can correspond to a full-length gene or a partial gene, and may be of various lengths. Preferably the parent polynucleotide is from 50 to 50,000 base pairs. It is contemplated that entire vectors containing the nucleic acid encoding the parent protein of interest may be used in the methods of this invention.

[0061] Any method can be used for generating mutations in the parent polynucleotide sequence to provide a library of evolved polynucleotides, including error-prone polymerase chain reaction, cassette mutagenesis (in which the specific region optimized is replaced with a synthetically mutagenized oligonucleotide), oligonucleotide-directed mutagenesis, parallel PCR (which uses a large number of different PCR reactions that occur in parallel in the same vessel, such that the product of one reaction primes the product of another reaction), random mutagenesis (e.g., by random fragmentation and reassembly of the fragments by mutual priming); site-specific mutations (introduced into long sequences by random fragmentation of the template followed by reassembly of the fragments in the presence of mutagenic oligonucleotides); parallel PCR (e.g., recombination on a pool of DNA sequences); sexual PCR; and chemical mutagenesis (e.g., by sodium bisulfite, nitrous acid, hydroxylamine, hydrazine, formic acid, or by adding nitrosoguanidine, 5-bromouracil, 2-aminopurine, and acridine to the PCR reaction in place of the nucleotide precursor; or by adding intercalating agents such as proflavine, acriflavine, quinacrine); irradiation (X-rays or ultraviolet light, and/or subjecting the polynucleotide to propagation in a host cell that is deficient in normal DNA damage repair function); or DNA shuffling (e.g., in vitro or in vivo homologous recombination of pools of nucleic acid fragments or polynucleotides). Any one of these techniques can also be employed under low-fidelity polymerization conditions to introduce a low level of point mutations randomly over a long sequence, or to mutagenize a mixture of fragments of unknown sequence.

[0062] Once the evolved polynucleotide molecules are generated they can be cloned into a suitable vector selected by the skilled artisan according to methods well known in the art. If a mixed population of the specific nucleic acid sequence is cloned into a vector it can be clonally amplified by inserting each vector into a host cell and allowing the host cell to amplify the vector and/or express the mutant or variant protein or enzyme sequence. Any one of the well-known procedures for inserting expression vectors into a cell for expression of a given peptide or protein may be utilized. Suitable vectors include plasmids and viruses, particularly those known to be compatible with host cells that express oxidation enzymes or oxygenases. E. coli is one exemplary preferred host cell. Other exemplary cells include other bacterial cells such as Bacillus and Pseudomonas, archaebacteria, yeast cells such as Saccharomyces cerevisiae, insect cells and filamentous fungi such as any species of Aspergillus cells. For some applications, plant, human, mammalian or other animal cells may be preferred. Suitable host cells may be transformed, transfected or infected as appropriate by any suitable method including electroporation, CaCl₂ mediated DNA uptake, fungal infection, microinjection, microprojectile transformation, viral infection, or other established methods.

[0063] The mixed population of polynucleotides or proteins may then be tested or screened to identify the recombinant polynucleotide or protein having a higher level of the desired activity or property. The mutation/screening steps can then be repeated until the selected mutant(s) display a sufficient level of the desired activity or property. Briefly, after the sufficient level has been achieved, each selected protein or enzyme can be readily isolated and purified from the expression system, or media, if secreted. It can then be subjected to assays designed to further test functional activity of the particular protein or enzyme. Such experiments for various proteins are well known in the art, and are described below and in the Examples below.

[0064] The evolved enzymes can be used in biocatalytic processes for, e.g., alkane hydroxylation and alkene epoxidation, or for improving yield of reactions involving oxidation of substrates with low solubility in aqueous solutions. The enzyme variants of the invention can be used in biocatalytic processes for production of chemicals from hydrocarbons, particularly alkanes and alkenes, in soluble or immobilized form. Furthermore, the enzyme variants can be used in live cells or in dead cells, or it can be partially purified from the cells. One preferred process would be to use the enzyme variants in any of these forms (except live cells) in an organic solvent, in liquid or even gas phase, or for example in a super-critical fluid like CO₂. The organic solvent would dissolve high concentrations of the non-polar substrate, so that the enzyme could work efficiently on that substrate.

[0065] Recycling the cofactor can present difficulties for such a process. However, cofactor recycling methods well known in the art can be applied. For example, an enzyme capable of regenerating the cofactor, using a second substrate can be used. Alternatively, the enzyme can be used in living cells, and the cofactor recycling can be accomplished by feeding the cells the appropriate substrate(s). The NADPH and oxygen can also be replaced by a peroxide, for example hydrogen peroxide, butyl peroxide or cumene peroxide, or by another oxidant. Mutations that enhance the efficiency of peroxide-based oxidation by BM-3 or other cytochrome P450 enzymes can serve to enhance the peroxide shunt activity of the enzyme variants described here. The mutations described here can be combined with such mutations, for example, and tested for their contributions to peroxide-driven alkane and alkene oxidation.

Screening Assays

[0066] The method of screening for identifying mutants or variants, for further testing or for the next round of mutation, will depend on the desired property sought. For example, in this invention, recombinant nucleic acid which encode cytochrome P450 enzymes with improved alkane-oxidation capability and/or solvent-resistance can be screened for alkane-oxidation activity or for activity or stability in a solvent/co-solvent mixture. Such tests are well known in the art. Examples of suitable tests are provided in the Examples and discussed below.

[0067] In a broad aspect, a screening method to detect oxidation comprises combining, in any order, substrate, oxygen donor, and test oxidation enzyme. The assay components can be placed in or on any suitable medium, carrier or support, and are combined under predetermined conditions. The conditions are chosen to facilitate, suit, promote, investigate or test the oxidation of the substrate by the oxygen donor in the presence of the test enzyme, and may be modified during the assay. The amount of oxidation product, i.e., oxidized substrate, is thereafter detected using a suitable method. Further, as described in WO 99/60096, a screening method can comprise a coupling enzyme such as horseradish peroxidase to enable or enhance the detection of successful oxidation. In some embodiments, one or more cofactors, coenzymes and additional or ancillary proteins may be used to promote or enhance activity of the test oxidation enzyme, coupling enzyme, or both.

[0068] In a preferred embodiment, it is not necessary to recover test enzyme from host cells that express them, because the host cells are used in the screening method, in a so-called “whole cell” assay. In this embodiment, substrate, oxygen donor, and other components of the screening assay, are supplied to the transformed host cells or to the growth media or support for the cells. In one form of this approach, the test enzyme is expressed and retained inside the host cell, and the substrate, oxygen donor, and other components are added to the solution or plate containing the cells and cross the cell membrane and enter the cell. Alternatively, the host cells can be lysed so that all intracellular components, including any recombinantly expressed intracellular enzyme variant, can be in direct contact with any added substrate, oxygen donor, and other components.

[0069] Resulting oxygenated products are detected by suitable means. For example, the oxidation product may be a colored, luminescent, or fluorescent compound, so that transformed host cells that produce more active oxidation enzymes “light up” in the assay and can be readily identified, and can be distinguished or separated from cells which do not “light up” as much and which produce inactive enzymes, less active enzymes, or no enzymes. A fluorescent reaction product can be achieved, for example, by using a coupling enzyme, such as laccase or horseradish peroxidase, which forms fluorescent polymers from the oxidation product. A chemiluminescent agent, such as luminol, can also be used to enhance the detectability of the luminescent reaction product, such as the fluorescent polymers. Detectable reaction products also include color changes, such as colored materials that absorb measurable visible or UV light.

[0070] To improve the activity of P450 BM-3 or other cytochrome P450 enzymes towards alkanes by directed evolution, a rapid, reproducible screen that is sensitive to small changes (<2-fold) in activity is desirable (Arnold, 1998). Therefore, an alkane analog such as 8-pnpane (see FIG. 2 and Example 1), can be prepared that generates yellow color upon hydroxylation. This “surrogate” substrate with a C8 backbone and a p-nitrophenyl moiety is an analog of octane, and allows use of a colorimetric assay to conveniently screen large numbers of P450 BM-3 or other cytochrome P450 variants mutants for increased hydroxylation activity in microtiter plates (Schwaneberg et al., 1999; Schwaneberg et al., 2001). Hydroxylation of 8-pnpane generates an unstable hemiacetal which dissociates to form (yellow) p-nitrophenolate and the corresponding aldehyde (FIG. 2). The hydroxylation kinetics of hundreds of mutants can then be monitored simultaneously in the wells of a microtiter plate using a plate reader (FIG. 3) (Schwaneberg et al., 2001). This method is particularly suitable for detecting P450 variant with improved alkane-oxidation activity.

[0071] To screen for improved solvent-resistance, in particular for P450 BM-3 variants, a substrate such as 12-pNCA can be added together with an organic co-solvent (e.g., tetrahydrofurane (THF), DMSO, ethanol, methanol, acetone, etc.) and 12-pNCA conversion initiated by adding a Isocitric co-factor regeneration solution (e.g., Isocitric acid 20 mM; dH₂O, NADP+3 mM, Isocitric dehydrogenase 0.8 U/ml). After visible color development, the reaction can be stopped by adding UT-buster (NaOH 1.5 M, 1.5 M Urea, 50% (v/v) DMSO), and absorption at 410 nm recorded.

[0072] Enzyme variants displaying improved levels of the desired activity or property in the screening assay(s) can then be expressed in higher amounts, retrieved, optionally purified, and further tested for the activity or property of interest.

Activity Assays

[0073] The cytochrome P450 variants created by directed evolution and selected for a desired property or activity can be further evaluated by any suitable test or tests known in the art to be useful to assess the property or activity. For example, the enzyme variants can be evaluated for their alkane-oxidation capability, alkene-oxidation capability, and/or organic-solvent resistance.

[0074] An assay for alkane-oxidation capability essentially comprises contacting the cytochrome P450 variant with a specific amount of alkane substrate, or a substrate which is an alkane analog such as 8-pnpane, in the presence of an oxygen donor, and any other components (e.g., NADPH) that are necessary or desirable to include in the reaction mixture, such as NADPH and buffering agents. After a sufficient incubation time, the amount of oxidation product formed, or, alternatively, the amount of intact non-oxidized substrate remaining, is estimated. For example, the amount of oxidation product and/or substrate could be evaluated chromatographically, e.g., by mass spectroscopy (MS) coupled to high-pressure liquid chromatography (HPLC) or gas chromatography (GC) columns, or spectrophotometrically, by measuring the absorbance of either compound at a suitable wavelength. By varying specific parameters in such assays, the Michaelis-Menten constant (K_(m)) and/or maximum catalytic rate (V_(max)) can be derived for each substrate as is well known in the art. Preferred substrates include, but are not limited to, methane, ethane, propane, butane, pentane, hexane, heptane, octane, and cyclohexane. In addition, in particular by HPLC and GC techniques, particularly when coupled to MS, can be used to determine not only the amount of oxidized product, but also the identity of the product. For example, octane can be oxidized to octanol where the hydroxyl group is positioned on any of the carbon atoms in the octanol molecule.

[0075] Alkene-oxidation can be evaluated by methods similar to those described for alkanes, simply by replacing an alkane with the corresponding alkene, and designing an assay which promotes and detects epoxide formation of the alkene. For example, an assay which detects NADPH consumption may be used. Preferred alkene substrates include ethene, propene, butene, pentene, hexene, heptene, and octene.

[0076] Organic solvent resistance of a cytochrome P450 variant is advantageously evaluated by conducting an oxidation reaction in the presence of a certain amount of organic solvent or co-solvent. This amount can be varied from, e.g., about 0.1% to about 99.9% (v/v) organic solvent or co-solvent, more preferably from about 0.5% to about 50% (v/v) organic solvent or co-solvent, and, most preferably, from about 1% to about 10% (v/v) organic solvent or co-solvent, of the total reaction volume. The amount of oxidation product is then detected as a measure of the organic-solvent resistance of the enzyme variant. Such assays can be conducted using various amounts of solvent or co-solvent, and on enzyme variants stored for various periods of time in solutions comprising a certain amount of organic solvent or co-solvent. Preferred organic co-solvents include THF, DMSO, acetone, acetonitrile, and ethanol.

P450 BM-3 Variants

[0077] Described herein are several mutations that have been identified to improve the alkane-oxidation activity and/or alkene-oxidation activity. Thus, a P450 BM-3 variant of the invention can comprise at least one of these mutations, optionally in combination with another mutations selected from the ones described in Table 1A, a mutation not described in Table 1A, or no other mutation. The variant P450 BM-3 enzymes of the invention can have a higher oxidation activity towards a saturated hydrocarbon, e.g., octane, hexane, cyclohexane, propane, ethane, and/or butane, than wild-type P450 BM-3. Preferred amino acid mutations are those listed in Table 1A. The skilled artisan could easily identify P450 BM-3 variants, including variants comprising truncated, deleted, and inserted amino acid sequences, that comprise one or more of these mutations and that show enhanced alkane-oxidation activity in a suitable assay as compared to wild-type P450 BM-3.

[0078] As identified in Example 3, the particularly active P450 BM-3 mutants LX139-3 and “J” comprised 11 and 10 amino acid mutations, respectively; V78A, H138Y, T175I, V178I, A184V, H236Q, E252G, R255S, A290V, A295T, and L353V for IX139-3; and V78A, T175I, A184V, F205C, S226R, H236Q, E252G, R255S, A290V, and L353V for “J”. Using wildtype P450 BM-3 as the parent, a clone with approximately 2-fold increased activity for 8-pnpane was isolated. A single base mutation, threonine (T) to glycine (G), was found, which resulted in the amino substitution H236Q. Using H236Q as the parent for the 2nd generation, 9 mutants were identified in the 2nd generation with at least a 2-fold increase in activity. Recombining the 9 clones from the 2nd generation, a single clone was identified with 3 amino acid substitutions at V78A, H236Q, and E252G. These mutations, were noted as particularly effective mutations for improving alkane-oxidation activity, since these mutations were preserved throughout the evolutionary process. For example, H236Q was found in the first generation of mutants, and was preserved throughout the process, and the mutations V87A and E252G were discovered in the recombination of 9 clones from the 2^(nd) generation mutants, and thereafter preserved. Accordingly, a P450 BM-3 mutant comprising at least one, preferably at least two, and most preferably all three of these mutations, or a nucleic acid encoding such mutants, is a preferred embodiment of the invention.

[0079] In addition, the invention provides P450 BM-3 variants with a higher organic solvent resistance, especially towards water-miscible co-solvents such as DMSO and THF. A P450 BM-3 variant with improved organic solvent-resistance according to the invention comprises at least one of the mutations in Table 1A, optionally in combination with other mutations selected from the ones described in Table 1A, a mutation not described in Table 1A, or no other mutation. As noted in Example 2, P450 BM-3 variants having particularly improved solvent resistance (F87A5F5 and W5F5) comprised R471A, E494K, and S1024E mutations, optionally with the mutation F87A. Accordingly, a P450 BM-3 mutant comprising at least one, preferably at least two, and most preferably all three of these mutations, with or without the F87A mutation, or a nucleic acid encoding such mutants, is a preferred embodiment of the invention. A skilled artisan could easily identify P450 BM-3 variants, including variants comprising truncated, deleted, and inserted amino acid sequences, that comprise one or more of the mutations in Table 1A and that show improved organic solvent-resistance in a suitable assay as compared to wild-type P450 BM-3.

[0080] Directed evolution techniques can thus significantly improve the organic solvent resistance of P450 BM-3 and its mutant P450 BM-3 F87A toward DMSO and THF, or any other organic solvents or co-solvents. It was also identified herein that position F87—located at the end of the substrate access channel directly above the heme—plays a size-dependent key role in modulating the monooxygenase activity in organic co-solvents. Because the mutations are located at the interface between the reductase domain and the P450 domain, it is believed, while not being limited to any theory, that the mutations create tighter domain bonding, which support electron transfer from the reductase to the heme in the presence of cosolvent.

[0081] Notably, except for mutation 1024 (which is not in the crystallized reductase fragment) all of the mutations improving organic solvent resistance are located at the interface between heme domain and reducase. It is therefore possible that the mutations “stabilize” the orientation of heme to reducase allowing an electron transfer from the reducase to the heme. Potentially, and without being bound to any theory, this electron-transfer could be essential, or at least important, for activity.

[0082] Preferably, the P450 BM-3 variants of the invention have an at least two-fold improvement in the capability to oxidize a chosen alkane (e.g., octane, hexane, pentane, butane, cyclohexane, or propane), and/or an at least two-fold improvement in the organic solvent-resistance, as compared to wild-type P450 BM-3. Even more preferably, the improvement for either or both of these properties is at least 3-fold, at least 4-fold, at least 5-fold, at least 10-fold, or at least 15-fold.

[0083] The P450 mutations of the present invention are wholly unexpected. For example, Schmid and coworkers recently described the engineering of P450 BM-3 to hydroxylate short chain (C8-C10) fatty acids not accepted by the wildtype enzyme (Li et al., 2001). Guided by the x-ray crystal structure, they focused on eight individual amino acids within the binding pocket to perform saturation mutagenesis. This approach successfully discovered mutations (Val26Thr, Arg47Phe, Ala74Gly, Leu188Lys, Phe87Ala) that altered the enzyme∝s substrate specificity. An indole-hydroxylating triple mutant of BM-3 (Phe87Val, Leu188Gln, Ala74Gly) was also shown to bydroxylate octane (Appel et al., 2001). Random mutagenesis, however, can discovery other, often subtle ways to modify activity. For example, sequencing the gene for the P450 BM-3 variants (Table 1) revealed that none of the more active mutants described herein contained any of the mutations found by Schmid and coworkers. Furthermore, the best mutants described here were considerably more active on octane than the triple mutant reported by Schmid and coworkers (Appel et al., 2001).

[0084] Attempts to engineer catalyst specificity are often limited to altering amino acids directly involved in substrate recognition and binding. Only one residue that is in direct contact with substrate in the wildtype enzyme has been mutated in IX139-3 (V78A). In fact, many of the mutations discovered herein were not found in the substrate binding channel, as suggested by the structure of P450 BM-3 with the bound fatty acid substrate. Some are in the hydrophobic core of the protein. Furthermore, several of the mutations are found in the F and G helices and the loop connecting them. This region undergoes the largest movements upon substrate binding (Li and Poulos, 1997). Mutations in this region may be responsible for new binding characteristics and activity for alkanes. How these particular substitutions enhance activity towards alkanes is not clear. They would have been very difficult to identify using currently available structure-based design methods. The cytotoxicity of the new variants indicates that enzymes showing such high alkane hydroxylation activity may not be produced by living cells under natural conditions and could only be generated by laboratory evolution strategies.

Non-P450 BM-3 Variants

[0085] As described above, the invention also provides for novel non-P450 BM-3 cytochrome P450 oxygenases in which one or more of the amino acid residues listed in Table 1A have been conserved. Conservation of an amino acid residue can show that the residue has an important function for the oxygenase activity and/or stability of the P450 enzyme. Thus, the P450 BM-3 mutations identified herein to improve alkane-oxidation activity and/or solvent resistance can simply be translated onto such non-P450 BM-3 enzymes to yield improved properties according to the invention.

[0086] Any method can be used to “translate” the P450 BM-3 mutation onto another cytochrome P450 enzyme, and such methods are well known in the art. For example, sequence alignment software such as SIM (alignment of two protein sequences), LALIGN (finds multiple matching subsegments in two sequences), Dotlet (a Java applet for sequence comparisons using the dot matrix method); CLUSTALW (available via the World Wide Web as freeware), ALIGN (at Genestream (IGH)), DIALIGN (multiple sequence alignment based on segment-to-segment comparison, at University of Bielefeld, Germany), Match-Box (at University of Namur, Belgium), MSA (at Washington University), Multalin (at INRA or at PBIL), MUSCA (multiple sequence alignment using pattern discovery, at IBM), and AMAS (Analyse Multiply Aligned Sequences). A person of skill can choose suitable settings, or simply use standard default settings, in these programs to align P450 BM-3 with another cytochrome P450 enzyme. See FIG. 20 for representative sequence alignments, and Table 2 for representative non-P450 BM-3 mutations.

[0087] Alternatively, such sequence alignments of P450 BM-3 with other cytochrome P450 enzymes can be taken from the literature, and amino acid residues corresponding to the mutated amino acid residues of the invention identified. For example, such information can be derived from de Montellano (1995) (see, especially, FIG. 1 on page 187).

[0088] While some P450 enzymes may not share significant sequence similarities, particular domains such as the heme-containing domains of P450s do display close structural similarity (Miles et al., 2000). Therefore, the positions of the various mutations described here could be translated to similar positions in different P450s having very low sequence similarity to P450 BM-3 using molecular modeling of those P450s based on sequence homology. Examples of using such techniques to model various P450s based on sequence homology with P450 BM-3 are available (Lewis et al., 1999). The same mutations described here, when placed in their corresponding positions in other P450 structures (as determined by modeling) would confer similar improvements in alkane-/alkene oxidation activity and organic solvent resistance.

[0089] In this regard, FIG. 21 shows a topological view of a cytochrome P450 enzyme, including the various domains of cytochrome P450 enzymes and the mutations contemplated by the present invention in each of those domains. While the topological view presented in FIG. 21 is that of P450_(BM-P), with only minor modifications, this topology diagram may be used for other P450s. Briefly, FIG. 10 shows where mutations disclosed herein were made and these are summarized in Table 3 below. TABLE 3 Locations of Selected P450 BM-3 Mutations Domain Amino Acid Residue Heme domain: helix B′ V78 Heme domain: loop connecting helices B′ and C F87 Heme domain: β3-1 H138 Heme domain: helix F T175 Heme domain: helix F V178 Heme domain: helix F A184 Heme domain: helix F N186 Heme domain: helix G D217 Heme domain: helix G S226 Heme domain: helix H H236 Heme domain: helix I E252 Heme domain: helix I R255 Heme domain: helix J A290 Heme domain: helix J A295 Heme domain: β1-3 L353 Heme domain: loop connecting helices K′ and L G396 — R471 Helix in flavin domain E494 — S1024

[0090] Therefore, based on the topological view presented in FIG. 10, a P450 variant may be prepared by making one or more mutations in one or more of the domains of P450 identified in Table 3 above. Further, the topological view of FIG. 10 allows one to compare BM-3 variants with other P450 enzymes and identify those residues of non-BM-3 enzymes that could be mutated according to the secondary and tertiary structural motifs within the enzyme(s).

[0091] Thus, the invention provides novel non-P450 BM-3 cytochrome P450 oxygenases in which one or more of the amino acid residues listed in Table 1A have been conserved. Conservation of an amino acid residue can show that the residue has an important function for the oxygenase activity and/or stability of the P450 enzyme. The P450 BM-3 mutations identified herein to improve utilization of hydrogen peroxide as oxygen source and/or thermostability can simply be translated onto such non-P450 BM-3 enzymes to yield improved properties according to the invention.

[0092] Once the corresponding amino acid residues have been identified, a person of skill can test various mutations of these amino acid residues to identify those that yield improved alkane-oxidation capability or improved organic solvent-resistance as compared to the cytochrome P450 wild-type enzyme. Preferred amino acid substitutions are those that correspond to a substitution listed in Table 1A for P450 BM-3 mutations.

EXAMPLES

[0093] The invention is illustrated in the following examples, which are provided by way of illustration and are not intended to be limiting.

Example 1 Directed Evolution of a Cytochrome P450 Monooxygenase for Alkane Oxidation

[0094] This Example describes the discovery of P450 BM-3 variants. P450 BM-3 variants were created and identified by directed evolution techniques, specifically iterative cycles of random mutagenesis and recombination, and functional screening.

[0095] All chemical reagents were procured from Aldrich, Sigma, or Fluka. Enzymes were purchased from New England Biolabs, Stratagene, and Boehringer Mannheim. The 1H NMR spectrum was recorded on a Varian 300 MHz nuclear magnetic resonance spectrometer with a Mercury console. Quantitative Technologies Inc. (Whitehouse, N.J.) performed elemental analysis.

A. Random Library Generation and Screening of P450 BM-3

[0096] P450 BM-3 modified to contain a His6 tag was amplified from pT-USC1 BM-3 (Schwaneberg et al., Anal Biochem 1999a, 269:359-366) by PCR techniques using a proofreading polymerase Pfu to introduce a BamHI upstream of the start codon and an EcoRI site immediately downstream from the stop codon. The two oligonucleotides used were as follows: (BamHI site underlined, SEQ ID NO:7) 5′-CGCGGATCCATCGATGCTTAGGAGGTCATATGACAATTAAAGAAATG CCTC-3′ (EcoRI site underline, SEQ ID NO:8) 5′-CCGGAATTCTTAATGATGATGATGATGATGCCCAGCCCACACGTCTT TTGC-3′

[0097] The PCR product was digested with BamHI and EcoRI. The P450 BM-3 gene was ligated into expression vector pCWOri (+) (Barnes, 1996) (p BM-3 WT18-6), which is under the control of double Ptac promoter and contains an ampicillin resistance coding region. A silent mutation was introduced to construct a SacI site 130 bases upstream of the end of the heme domain. The QuikChange (Stratagene) protocol was followed and the primers were as follows: (SacI site underlined; SEQ ID NO:9) 5′-ACATACAAACTACGAGCTCGATATTAAAGAAAC-3′ (SacI site underlined; SEQ ID NO:10) 5′-GTTTCTTTAATATCGAGCTCGTAGTTTGTATG-3′

Synthesis of p-nitrophenoxyoctane (8 pnpane)

[0098] 1-bromooctane (1 g, 5.18 mmole) and 4-nitrophenol, sodium salt (0.92 g, 5.71 mmole) were refluxed in DMSO (30 ml) at 120° C. for 5 hours. The DMSO was distilled off to near dryness. The resulting brown residue was loaded onto a silica column and eluted with 10:1 mixture of petroleum ether and diethylether. The yield was 30%. 1H NMR (CDCl₃, peaks at=8.18 (m, 2H), 6.93 (m, 2H), 4.04 (t, 2H), 1.81 (p, 2H), 1.33 (m, 10H), 0.89 (t, 3H). Elemental analysis calculated for C₁₄H₂₁O₃N: C, 66.91; H, 8.42; N, 5.57. Found: C, 66.97; H, 8.34; N 5.52.

Expression and Purification of P450 BM-3 Variants

[0099] The P450 BM-3 gene, which includes a silent mutation to introduce a SacI site 130 bp upstream of the end of the heme domain, was cloned behind the double tac promoter of the expression vector pCWori (p BM-3_WT18 6) (Farinas et al., 2001). This plasmid was used for production of wildtype protein and as a starting clone for directed evolution. For protein production, Terrific Broth (TB) media (500 ml) supplemented with trace elements (125 μL: 0.5 g MgCl₂, 30.0 g FeCl₂ 6H₂O, 1.0 g ZnCl₂ 4H₂O, 0.2 g CoCl₂ 6H2O, 1.0 g Na₂MoO₄ 2H₂O, 0.5 g CaCl₂ 2H₂O, 1.0 g CuCl₂ and 0.2 g H₂BO₃ in 1L HCl solution (90% v/v distilled water:concentrated HCl)) (Joo et al., 1999) was inoculated with 500 μl of an overnight culture of E. coli BL21 containing the expression plasmid. After shaking for 10 hours at 35° C. and 250 rpm,. aminolevulinic acid hydrochloride (ALA) (0.5 mM) was added, and expression was induced by addition of IPTG (1 mM) and cells were harvested by centrifugation after a total cultivation time of 30 hours.

[0100] After 30 hours, the cells were harvested by centrifugation and the supernatants discarded. The pellets were washed with Tris HCl (15 ml, pH 8.3). Cells were resuspended in Tris HCl (15 ml, pH 8.3), sonicated (2×45 sec; output control=7, duty cycle 40%; Sonicator, Heat Systems Ultrasonic, Inc.) and centrifuged. The supernatants were further cleared through a 0.45 μM filter. The filtrate was diluted 15 mL water and purified by published procedures (Farinas 2001). P450 BM-3 concentrations were measured from the CO difference spectra (Omura 1963).

Library Construction

[0101] For the first 2 generations, mutagenic PCR was performed on the heme domain in a 100 mL reaction as described in Zhao et al. (1999) with some modifications. The mutated P450 BM 3 fragment was 1291 base pairs. The reaction contained MgCl₂ (7 mM) and the following forward and reverse primers (40 pmol each): (Forward, SEQ ID NO:11) 5′ ACAGGATCCATCGATGCTTAGGAGGTCATATG 3′ (Reverse, SEQ ID NO:12) 5′ GTGAAGGAATACCGCCAAG 3′.

[0102] The reaction also contained p BM-3 WT18-6 (10 ng), dNTPs (0.2 mM dGTP, 0.2 mM dATP, 1 mM dCTP, 1 mM dTTP), and Taq polymerase (5 units, Roche), KCl (50 mM), and Tris-HCl (10 mM, pH 8.3, 20° C.). MnCl2 (0.0, 0.05, and 0.1 mM) was added to the PCR mixture to alter the error rate of the polymerase. PCR was performed in a thermocycler (PTC200, MJ Research, Waltham, Mass.) for 30 cycles (95° C., 45 s; 50° C., 30 s; 72° C., 2 min). The PCR product was restricted with BamHI and SacI and ligated into expression vector pCWOri (+). The resulting plasmid was transformed into E. coli strain DH5α and the colonies were selected on agar plates containing ampicillin (100 mg/ml).

[0103] For the 3^(rd) generation, 9 mutants from the 2^(nd) generation showing at least 2-fold improved activity on 8-pnpane were recombined by staggered extension process (StEP) (Zhao H M, et al. Nature Biotechnol 1998;16:258 261). Recombination was done in 50 μL reactions as described (Zhao et al., 1999). Each reaction contained buffer (Qiagen 1×PCR buffer), template (10 ng each), forward and reverse primer (final 0.15 mM), dNTPs (200 μM each) and Taq DNA polymerase (2.5 units, Qiagen). PCR was performed in a thermocycler (PTC200, MJ Research, Waltham, Mass.) (1.95° C., 2 min; 2.95° C. 30 s; 3.50° C., 10 s; repeat steps 2 and 3 100×). The PCR product was restricted with BamHI and SacI and ligated into expression vector pCWOri (+). The resulting plasmid was transformed into E. coli strain DH5α and the colonies were selected on agar plates containing ampicillin (100 mg/ml).

[0104] For the 4^(th) and 5^(th) generation, error prone PCR was also performed using the GeneMorph PCR Mutagenesis Kit (Stratagene) applying conditions of high error rate (1-10 ng template DNA). The PCR product was restricted with BamHI and SacI and ligated into expression vector pCWOri (+). The resulting plasmid was transformed into E. coli strain DH5α and the colonies were selected on agar plates containing ampicillin (100 mg/ml).

[0105] Recombination was done in 50 mL reactions as described (Zhao et al., 1999). Each reaction contained buffer (Qiagen 1×XPCR buffer), template (10 ng each), forward and reverse primer (final 0.15 mM), dNTPs (200 μM each) and Taq DNA polymerase (2.5 units, Qiagen). PCR was performed in a thermocycler (PTC200, MJ Research, Waltham, Mass.) (1.95° C., 2 min; 2.95° C. 30 s; 3.50° C., 10 s; repeat steps 2 and 3 100×).

Screening for Hydroxylation Activity Cultivation and Expression of P450 BM-3 Mutant Libraries

[0106] For the first three rounds of evolution, a robot (Qpix, Genetix) picked and inoculated colonies into 1 ml deep-well plates containing LB media (400 ml) and ampicillin (100 mg/ml). The plates were incubated at 37° C., 270 RPM, and 80% relative humidity. After 24 hours, the culture liquid (50 ml) was added to TB (450 ml) containing, ampicillin (100 mg ampicillin/ml), thiamine (5 mg/ml), and trace elements (0.25 ml/ml). After growth at 37° C. for 1 hour, -aminolevulinic acid hydrochloride (1 mM) and isopropyl—thiogalactopyranoside (1 mM) was added. The temperature was shifted to 30° C. and the cultures were grown for 24 hours.

[0107] For the 4^(th) and 5^(th) generations the screening procedure was modified to the following procedure. The plates with the picked colonies were incubated in LB (containing 100 μg/L ampicillin) at 30° C., 270 RPM, and 80% relative humidity. After 24 hours, TB (500 ml) containing, ampicillin (100 mg ampicillin/ml), thiamine (5 mg/ml), and trace elements (0.25 ml/ml), δ-aminolevulinic acid hydrochloride (1 mM) and 10 μM isopropyl-thiogalactopyranoside, was inoculated with the preculture using a 96 inoculation pin and grown for 24 hours.

Preparation of Cell Lysates

[0108] For the first three generations, the plates were centrifuged and supernatants were discarded. Cell pellets were washed with Tris-HCl (350 ml, pH 8.3), frozen at −20° C. for at least 8 hours and then resuspended in 400 μl Tris-HCl (350 ml, pH 8.3) containing lysozyme (0.5 mg/ml), deoxyribonuclease I (0.1 mg/ml) and MgCl₂ (10 mM). After incubation at 37° C. for 45 minutes, the plates were centrifuged and the lysate (150 ml) was transferred to a 96-well plate. For the final generations, the frozen cell pellets were resuspended in phosphate buffer (1 mL, 0.1 M, pH 8.0) containing lysozyme (0.5 mg/mL0, DnaseI (0.1 μL/mL) and MgCl₂ (10 mM). The lysates were centrifuged and the supernatants were diluted for activity measurements in 96 well microtiter plates.

High Throughput Determination of Enzymatic Activity

[0109] For the first three generations, 8-pnpane (150 μM) in DMSO (1%) was added to the lysate and incubated at room temperature. After 5 minutes, NADPH (1 mM) was added and the absorbance at 410 nm was measured with a microplate spectrophotometer (SPECTRAmax, Molecular Devices).

[0110] For the final generations, mutant libraries were screened using 8-pnpane as described above. Also, a cofactor (NADPH) depletion assay was used to determine the turnover rates. The lysates were diluted into 96 well microtiter plates containing phosphate buffer (200 μL, 0.1 M, pH 8.0), alkane substrate (0.5-1.0 mM), and DMSO (1%). The liquid alkanes were added to the buffer using alkane stock solutions in DMSO, whereas gaseous alkanes were bubbled into buffer for ˜45 minutes to obtain saturated solutions. The reaction was initiated by addition of NADPH (200 μM), and the oxidation of NADPH was monitored at 340 nm. Only the mutants active in both screens were isolated and recharacterized.

Rescreening

[0111] The most active clones from the primary screen were streaked out on agar plates to get single colonies. Four to 8 single colonies were recultured in deep well plates and rescreened as described above. In the rescreen all clones were also assayed for hydroxylation of the target substrate octane. The same dilutions of lysates in buffer were used as in the 8-pnpane assay. A stock octane substrate solution (225 μM) in DMSO (1%) was added to-the lysates. After addition of NADPH (0.75 mM), in the same buffer as used for the lysates, the oxidation of NADPH to NADP⁺ was followed using the microplate spectrophotometer for 3 min at 340 nm. As a control the same assays were performed without addition of the alkane substrate to verify that NADPH consumption was coupled to the presence of substrate.

[0112] For propane oxidation, the cells were already lysed in a buffer, where the 0.1 M Tris-HCl from the lysis buffer described above was replaced by 30 mM phosphate buffer pH 7.4 to avoid organic substances (which could conceivably become substrates for the enzyme) in the buffer. Phosphate buffer was saturated with propane by bubbling propane into the buffer for 1 hour. Thirty μl of the bacterial lysates were pipeted into 96 well microtiter plates and 120 μl of the propane saturated buffer were added. The reaction was started again by addition of 50 μl of 3 mM NADPH and followed at 340 nm for 3 min.

Agar Plate Colony Development

[0113] Cells were grown on LB agar plates containing 100 μg/ml ampicillin, 1 mM aminolevulinic acid hydrochloride and 10 μM isopropyl thiogalactopyranoside. The latter two substances are not necessary if the activity resulting from a leaky promoter system is high enough. A substrate solution was prepared containing 30 mM phosphate buffer pH 7.4, 100 μM polymyxin B sulfate as a cell permeabilizer, 2 mM NADPH and 5 mM alkane substrate. The substrate solution was sonicated before use to emulsify most of the substrate in the buffer system.

[0114] A nitrocellulose membrane was soaked with this substrate solution and placed on the colonies on the agar plate. The optimal reaction time was estimated by testing the assay on control agar plates with colonies of clones with different hydroxylation activity before starting the main screen. After the reaction time the color reagent (0.5 mg/ml NBT in 30 mM phosphate buffer pH 7.4 and some crystals of the catalyst PMS) was pipeted directly onto the nitrocellulose membrane. Colonies located under white spots on the membrane were picked with a toothpick and streaked out on fresh agar plates to get single colonies for the re-screen.

Determination of the Maximum Initial Rates for Hydroxylation

[0115] The enzymes were purified and quantified as described above. First, the substrate concentration corresponding to the maximum turnover rate was determined by monitoring NADPH consumption with a plate reader in the presence of enzyme in phosphate buffer (0.1 M pH 8.0) and varying amounts of substrate in methanol (1%). After identifying the concentration of substrate that coincides with the maximum rate, the rate was measured using an UV-Vis spectrophotometer and 1 cm path length quartz cuvettes. A typical reaction solution contained enzyme (700 μL, 0.35-3.5 μM) in potassium phosphate buffer (0.1 M, pH 8.0) and substrate in methanol (1%). The reaction was initiated by the addition of NADPH (300 μL, 200 μM), and the absorption at 340 nm was monitored.

[0116] The amount of H₂O₂ was determined using 2,2′-azino-di-[3-ethyl-benzothiazidine-6-sulfonic acid/horseradish peroxidase assay by following published procedures (Yeom, H. & Sligar, S. G. Oxygen activation by cytochrome P450BM-3: effects of mutating an active site acidic residue. Arch Biochem Biophys 337, 209-216. (1997).

B. Wild-Type P450 BM-3 Alkane Oxidation Assay Octane Oxidation by P450BM-3

[0117] The enzymes were purified and quantified as described above. The oxidation was performed with solutions containing octane in DMSO (1 mM octane; 1% DMSO), P450 BM-3 (2-3 μM), and NADPH (1-5 mM) in Tris-HCl (50 mM) containing NaCl (340 mM). A concentrated solution of octane in DMSO was added to the enzyme. The resulting solution was incubated for 30 minutes at room temperature. Octane oxidation was initiated by the addition of NADPH in aqueous solution. After a specific time, the solution was extracted three times with of CH₂Cl₂ (333 mL) containing decanol (1 mM) as an internal standard. The organic layer was dried over anhydrous Na2SO₄, and the products were analyzed by GC/MS.

[0118] Catalytic activity of P450 BM-3 was measured spectrophotometrically by monitoring the rate of NADPH oxidation, as described (Matson et al. (1977). The assay solution contained 0.1 nmole P450 BM-3, octane in DMSO, and 0.8 mM NADPH, 50 mM NaCl in 0.1 M Tris-HCl, pH 8.2.

[0119] Production of H₂O₂ during the hydroxylation reaction was determined using the iron(III) thiocyanate assay (Fruetel et al., 1994). 1.0×10⁻⁹ mole P450 BM-3 was incubated with 1.0×10⁻⁶ mole octane for 5 minutes. The reaction was initiated by addition of 2.5×10⁻⁷ mole NADPH to the enzyme solution. Every 2 minutes a 0.2 ml aliquot from the reaction was added to 1.0 ml aqueous iron(II) solution (5.0 grams FeSO₄(NH₄)₂×6H₂O, 45.0 ml degassed H₂O, 5 ml concentrated H₂SO₄). Subsequently, 0.4 ml of a 10% aqueous solution of KSCN was added to the solution, and the absorbance was measured at 480 nm.

C. Assay for IX139-3 Catalytic Activity with Alkanes Determination of the Maximum Initial Rates for Liquid Alkane Hydroxylation by IX139-3

[0120] The enzymes were purified and quantified as described above. First, the substrate concentration corresponding to the maximum turnover rate was determined by monitoring NADPH consumption with a plate reader in the presence of enzyme in phosphate buffer (0.1 M pH 8.0) and varying amounts of substrate in methanol (1%). After identifying the concentration of substrate that coincides with the maximum rate, the rate was measured using an UV-Vis spectrophotometer and 1 cm path length quartz cuvettes. A typical reaction solution contained enzyme (700 μL, 0.35-3.5 μM) in potassium phosphate buffer (0.1 M, pH 8.0) and substrate in methanol (1%). The reaction was initiated by the addition of NADPH (300 μL, 200 μM), and the absorption at 340 nm was monitored. The substrates examined were pentane, hexane, cyclohexane, and octane.

[0121] The amount of H₂O₂ was determined using 2,2′-azino-di-[3-ethyl-benzothiazidine-6-sulfonic acid/horseradish peroxidase assay by following published procedures (Yeom & Sligar, 1997).

Determination of the Maximum Initial Rate for Gaseous Alkane Hydroxylation by IX IX139-3

[0122] Potassium phosphate buffer (0.1 M, pH 8.0) was saturated with the gaseous alkane (propane or butane) by bubbling the substrate into the solution for 1 hour. A typical reaction contained alkane saturated buffer (700 μL) and enzyme (0.5 μM). Addition of NADPH (300 μL, 200 μM) in buffer initiated the reaction, and the rate of NADPH oxidation was follow at 340 nm.

GC/MS Analysis

[0123] Biocatalytic oxidations were performed under oxygen limited conditions in sealed vials. For octane, hexane, or cyclohexane conversions, the solution contained alkane (1 mM) in DMSO (1% DMSO) and enzyme (1 μM) in potassium phosphate buffer (100 mM, pH 8.0). The solution was stirred at room temperature for 5 minutes, and the reaction was initiated by the addition of NADPH (1 mM).

[0124] Reactions with gaseous alkanes were carried out in a sealed 20 mL vial containing enzyme (1.0 μM) in potassium phosphate buffer (5 mL, 100 mM, pH 8.0). The headspace was filled with either propane or butane. The reaction was initiated with the addition of NADPH (1 mM). The reaction mixture was analyzed directly by GC/MS using an Hewlett Packard 5890 Series II gas chromatograph coupled with an Hewlett Packard 5972 Series Mass Selective Detector. The GC was fitted with HP FFAP column (crosslinked FFAP, 30 m×0.25 mm×0.25 mm film thickness). The condition for octane is as follows: Isothermic at 120° C. for 6 minutes. The condition for hexane and cyclohexane: (1) 100° C. for 5 minutes to 50° C. (2) 100° C. to 200° C. at 25° C./min. (3) Isothermic at 200° C. for 2 minutes. The condition for propane and butane: (1) 30° C. for 3 minutes. (2) 30° C. to 150° C. at 20° C./min. Authentic standards were used to identify the products by retention time. Products were further verified by matching the fragmentation distributions with a database in the software provided with the instrument manufacturer.

[0125] Initially for octane oxidation by wildtype, the products were identified with GC/MS using a Hewlett Packard 5890 Series II gas chromatograph coupled with a Hewlett Packard 5989A mass spectrometer. The GC was fitted with an HP 1 column (crosslinked methyl silicone gum, 12 m×0.2 mm×0.33 mm). The temperature gradient is as follows: 1) 40 to 50° C. at 15° C. at ,2) 50 to 75° C. at 10° C./min, 3) 75 to 160° C./min. Authentic standards were used to identify the retention times of the products. The products were further verified by matching the fragmentation distributions with a database in the software provided by the instrument manufacturer.

D. Results Wildtype P450 BM-3 is Active Towards Octane

[0126] In the presence of purified P450 BM-3 and NADPH, octane was consumed within 2 hr and gave several products detectable by GC/MS (FIG. 1). The total yield of product detected was about 50%, with alcohols accounting for about 90% of the product and ketones representing 10%. The major hydroxylated products were 4-octanol, 3-octanol, and 2-octanol. 1-Octanol was not detected under the experimental conditions applied. The product ratio was, approximately, 8:9:1; 4-octanol: 3-octanol: 2-octanol. 4-Octanone and 3-octanone were also present in the product mixture. A possible mechanism for the formation of the ketones is hydroxylation of the alcohol to generate a gem-diol which dehydrates to the corresponding ketone (Boddupalli et al., 1992; March, 1992a). Another possible mechanism is via the pinacol rearrangement (March, 1992b). However, it is also possible that traces of protein impurities were responsible for this oxidation, and further tests were therefore performed to validate the results.

[0127] It was tested whether P450 BM-3 catalyzes the oxidation of 3-octanol to 3-octanone. Upon addition of NADPH to a solution of P450 BM-3 and 3-octanol, a peak at 3.7 min appeared. This peak had the same retention time as an authentic sample of 3-octanone. Furthermore, the fragmentation pattern of the peak matched that of 3-octanone found in the mass spectrum database. Solutions containing only P450 BM-3 and 3-octanol, but no NADPH, do not produce any detectable 3-octanone. Similar results were obtained with 4-octanol as the substrate.

[0128] The activity of P450 BM-3 towards octane was measured spectrophotometrically by monitoring the rate of NADPH consumption. The following results were obtained: k_(cat)=0.7 s⁻¹, K_(m)=2.0×10⁻⁵ M and k_(cat)/K_(m)=3.5×10⁴ M⁻¹s⁻¹. Thus, on octane, the P450 BM-3 enzyme has a kcat 120-fold less and a Km that is 15 times larger (kcat/Km˜2000 times less) than on its preferred C16 fatty acid substrate.

P450 BM-3 Catalyzes Hydroxylation of Octane Without Uncoupling

[0129] Hydrogen peroxide production during catalysis by P450 BM-3 was also monitored in order to determine whether the rate of NADPH oxidation was affected by uncoupling (Oliver et al., 1997; Fruetel et al., 1994). H₂O₂ was not detected under the experimental conditions, which indicates that the P450 catalyzes the hydroxylation of octane without significant uncoupling. This was consistent with previous reports that indicated efficient coupling for P450 BM-3 acting on unnatural substrates such as styrene or alkyl trimethylammonium compounds (Oliver et al., 1997; Fruetel et al., 1994).

Directed Evolution Improves the Activity of Cytochrome P450 BM-3 Towards Octane and Other Substrates

[0130] Mutant library construction was focused on the P450 BM-3 heme domain, which contains the substrate binding site and the monooxygenase activity. The libraries were generated by error prone PCR (Zhao et al., 1999), using MnCl₂ concentrations of 0.0, 0.05, and 0.1 mM, which yielded 70, 60, and 50% active transformants, respectively. The library generated with no added MnCl₂, corresponding to about 2 base changes per gene (0.15% error rate), was chosen for screening (Zhao et al., 1999).

[0131] Two thousand clones of the first generation library were screened and 23 of the most active clones were selected for further analysis. Since the screen was sensitive to total activity, which includes increased expression as well as changes in specific activity, enzyme concentrations were estimated from the CO difference spectra for binding to the reduced P450 BM-3 in order to calculate specific activities (Omura and Sato, 1964). The best variant, VIII118 2C9, displayed approximately twice the specific activity of the wildtype enzyme (FIG. 4).

[0132] The 2^(nd) generation library was also created with error-prone PCR using VIII118 2C9 as the template, and 9 mutants were isolated that were between 1.8 and 2.5 times more active than VIII118 2C9. IX18_(—)18 was found to be 2.38 times more active than VIII118 2C9, and it was used for comparison for the 3^(rd) generation library.

[0133] The 3^(rd) generation was produced by StEP recombination (Zhao et al., 1999), using 9 variants from the 2^(nd) generation, and the most active mutant (IX35_(—)4) was found to be 1.74 times more active than IX118_(—)18. IX35_(—)4 was used for comparison for the 4^(th) generation.

[0134] The 4^(th) generation was constructed by error-prone PCR using the IX35_(—)4 as the template. The GeneMorph PCR Mutagenesis Kit (Stratagene) was used to create the libraries. The most active variant (IX 79_(—)1) was 1.35 times as active compared to IX35_(—)4. IX79_(—)1 was used for comparison for the 5^(th) generation library.

[0135] After some generations the evolved P450 BM-3 variants began showing high cytotoxicity. This caused large differences between the activities measured in the high throughput screen and then again after normalization to CO binding. As a result, the improvements in activity became smaller in generation 4, which might indicate that there is a “plateau”, i.e., a rather low limit to the activity which is possible for a P450 BM-3 variant created by this approach alone.

[0136] Even in the case of the strongly regulated promoter used in this study, there is always some leakiness of the promoter. Even small amounts of toxic proteins produced by the leaky promoter could cause large (negative) physiological effects on the different clones during growth in the 96 deep well plates. Therefore, the screening procedure was redesigned after generation four, as described under “Materials and Methods.”

[0137] The 5^(th) generation was produced using IX79_(—)1 as the template. Again, Genemorph was used to create the libraries. Using a modified screening procedure in generation five resulted in highly improved variants for alkane hydroxylation. The best three clones are between 1.24-1.86 times more active than IX79_(—)1. The best clone from the 5^(th) generation (IXI39-3) was 23 times more active than wildtype. The same library of the 5^(th) generation was used for a second screen using mutant IX139-3 for comparison.

[0138] Selected clones, including the three best clones from the first screen (IX139-3, IX139-37 and IX139-43), were recombined by StEP to produce the library of the 6^(th) generation. Several hundred clones of this new library have been analyzed using the microtiter plate assays, and results indicate further improvement of the alkane hydroxylating activity. For example, the mutant “J” (V78A, T175I, A184V, F205C, S226R, H236Q, E252G, R255S, A290V, and L353V) was shown to be 2.0 and 1.6 fold more active for NADPH consumption for propane and butane, respectively.

[0139] All improved clones found in the primary screen were rescreened for NADPH consumption by hydroxylation of octane. Only clones showing improved activity in presence of octane and a low background without octane were selected as templates for new generations. This gave some certainty that this was the right track for optimizing alkane hydroxylation activity. An assay for propane hydroxylation was included in the screening program in the sixth generation. The mutations, and the amino acid substitutions they lead to in the evolved P450 BM-3 enzymes, are listed below (Table 4). TABLE 4 Selected Cytochrome P450 Mutants Created By Directed Evolution Mutations were verified by sequencing one strand of DNA. Amino Acid Generation Clone Mutation Codon Mutation Wild-type viii18_6 1st Gen., ep PCR, taq viii118 2C9 His236Gln CAT→CAG 2nd Gen., ep PCR, taq viii159 7A4 Met30Ile ATG→ATT Asp232Gly GAT→GGT His236Gln CAT→CAG Met416Leu ATG→TTG viii159 7G4 Glu64Ala GAA→GCA Ile220Thr ATT→ACT His236Gln CAT→CAG Thr411Ala ACG→GCG viii159 10A6 Val78Ala GTA→GCA Phe162Ser TTT→TCT Lys224Ile AAA→ATA His236Gln CAT→CAG Lys306Glu AAA→GAA viii159 10A11 Thr10? ACG→NCG Glu13? GAG→GNG Met118Leu ATG→TTG Gly154Gly* GGT→GGA His236Gln CAT→CAG viii159 12A5 His236Gln CAT→CAG Ile258Thr ATT→ACT ix18_12 Pro45Pro* CCT→CC(T/C) His171Gln CAT→CAG His236Gln CAT→CAG Asp370Glu GAT→GAA ix18_18 Gln73Gln* CAA→CAG His236Gln CAT→CAG Ile259Val ATT→GTT Leu272Leu* CTT→CTC Lys289Lys* AAA→AAG Glu380Gly GAA→GGA ix18_34A Lys187Glu AAG→GAG His236Gln CAT→CAG ix18_38 Lys59Lys* AAA→AAG Lys97Lys* AAA→AAG His236Gln CAT→CAG Glu252Gly GAG→GGG Lys289Lys* AAA→AAG 3rd gen, step ix35_4 Val78Ala GTA→GCA His236Gln CAT→CAG Glu252Gly GAG→GGG 4th gen, genemorph ix79_1 Val78Ala GTA→GCA Phe107Phe* TTC→TTT Thr175Ile ACA→ATA Ala184Val GCA→GTA His236Gln CAT→CAG Glu252Gly GAG→GGG Ala290Val GCA→GTA Leu353Val CTA→GTA 5th gen, genmorph ix139-3 Val78Ala GTA→GCA Phe107Phe* TTC→TTT His138Tyr CAT→TAT Thr175Ile ACA→ATA Val178Ile GTC→ATC Ala184Val GCA→GTA His236Gln CAT→CAG Glu252Gly GAG→GGG Arg255Ser CGC→AGC Ala290Val GCA→GTA Ala295Thr GCA→ACA Leu353Val CTA→GTA Gln397Gln* CAG→CAA ix139-37 Val78Ala GTA→GCA Phe107Phe* TTC→TTT Asn159Asn* AAC→AAT Thr175Ile ACA→ATA Ala184Val GCA→GTA Asn186Asp AAC→GAC Arg203Arg* CGC→CGT Asp217Val GAT→GTT His236Gln CAT→CAG Glu252Gly GAG→GGG Ala290Val GCA→GTA Leu353Val CTA→GTA Gly396Met GGT→AGT Thr427Thr* ACA→ACT ix139_43 Glu4Glu* GAA→GAG Val78Ala GTA→GCA Phe107Phe* TTC→TTT Thr175Ile ACA→ATA Ala184Val GCA→GTA Ser226Ile AGC→ATC His236Gln CAT→CAG Glu252Gly GAG→GGG His266His* CAC→CAT Ala290Val GCA→GTA Leu353Val CTA→GTA

Evolved P450 BM-3 Variants Hydroxylate Various Substrates

[0140] The chosen screen was sensitive to hydroxylation of the methylene adjacent to the oxygen atom of the surrogate the p-nitrophenoxyoctane substrate. Activity towards octane, hexane, and cyclohexane was therefore measured, by measuring the rate of NADPH oxidation, which was assumed to be fully coupled to alkane oxidation. As a result, relative activities determined using NADPH oxidation were assumed to equal the relative activities on the different substrates.

[0141] Compared to wildtype, for octane oxidation, the k_(cat) for IX139-3 improved 50 times, while the K_(m) increased 45-fold (FIG. 5). The wildtype kinetic values for hexane and cyclohexane were of the same magnitude as for octane.

[0142] Results showing that hexane and cyclohexane were oxidized by IX79_(—)1 were verified with GC/MS. Samples containing the cell lysate of IX79_(—)1, hexane, and NADPH produced 2- and 3-hexanol (FIG. 6). 1-Hexanol was not detected. No products were found with IX79_(—)1 and hexane alone, as a control.

[0143] IX79_(—)1 in the presence of cyclohexane and NADPH produced cyclohexanol, and cyclohexanone was not detected (FIG. 7). Again, no products were detected with IX79_(—)1 and cyclohexane alone.

[0144] Results also showed that IX139-3 was able to oxidize propane and butane, as verified with GC/MS. The reaction mixture contained enzyme, propane or butane, and in the presence of NADPH produced 2-propanol and 2-butanol (FIG. 8), respectively. Terminal hydroxylation was not detected under these reaction conditions.

[0145] Since the activity towards medium chain alkanes had already reached a significant level in generation five and IX139-3 displayed detectable activity for propane oxidation, it was decided to include propane into the screening program in the sixth generation. NADPH consumption assays of the sixth generation showed that mutant IX139-3 and some new variants also oxidized propane.

Agar Plate Colony Assay for Screening Mutant Libraries

[0146] This assay was based on the formation of the purple dye formazan upon reaction of NADPH with Nitro Blue Tetrazolium (NBT) salt in presence of the catalyst, phenazine methosulfate (oxidized, PMS) This color reaction is known as a photometric standard assay for dehydrogenase activity resulting in reduction of their cofactor NADP+ to NADPH.

[0147] In the present investigations it was used to measure depletion of NADPH that accompanies the P450-catalyzed oxidation of substrate. Bacterial colonies which are locally using up the cofactor from filter paper soaked with NADPH and substrate for hydroxylation reactions remained white after reaction of the remaining NADPH in the filter by reaction with NBT. This screen resulted in white spots on a purple filter paper caused by bacterial colonies consuming NADPH. The screen will be used for prescreening libraries of future generations. Positive clones could be verified with the other assay systems that look directly at oxidized product formation, since clones showing strongly uncoupled NADPH consumption, and no substrate oxidation, might appear as improved hydroxylation variants (false positives).

Example 2 Directed Evolution of a Cytochrome P450 Monooxygenase for Organic Solvent Resistance

[0148] The total activity of P450 BM-3 in 96-well plates is-relatively low, especially in the presence of an organic solvent that further reduces the fraction of active enzymes. The mutant F87A converts the 12-pNCA substrate 4-5fold faster than the wild-type, the K_(m) value is 1.5-fold lower, and the chromophore from 2-pNCA substrate is released completely and not only to 33% (Farinas, 2001; Schwaneberg, 1999a). Therefore, the F87A mutant and not the wildtype was used as a starting point of the in vitro directed evolution. The evolutionary experiment to discover more organic solvent resistant variants was performed under very restrictive conditions in order to preserve the valuable properties of the parents, a for monooxygenases remarkably high total activity and thermostability. The thermostability was under selective pressure by using the temperature inducible PRPL-promoter system, and only clones that showed a high activity and a high organic solvent resistance were used as parents for further generations.

Experimental

[0149] All chemicals were of analytical reagent grade or higher quality and were purchased from Fluka, Sigma or Aldrich. THF (Aldrich, 99.9%) and DSMO (Mallinckrodt AR, 99.9%) were of highest available purity grade. Enyzmes were purchased from New England Biolabs, Stratagene, and Boehringer Mannheim.

Cultivation and expression in 96-Well Plates

[0150] The P450 BM-3 and P450 BM-3 F87A genes are under the control of the strong temperature inducible PRPL-promoter. Mutated BM-3 F87A variants were cloned into the pUSCI BM-3 vector by using BamHI//EcoRI or Age//EcoR1 restriction sites. Transformed clones grown on LBamp plates (Genetix) were transferred via a colony picker (QPix; Genetix) into 96 well plates (flat bottom; Rainin) containing 120 μl LB culture supplemented with 12 μg ampicillin per well. After growth for 12 hours at 37° C. in a shaked incubator (280 rpm) 3 μl of each culture was transferred with a grooved 96-pin deep-well replicator tool (V&P-Scientific) into 2 ml deep-well plates (Becton Dickinson) containing 400-500 μl of enriched TBamp medium. TBamp solution was supplemented with 75 μl trace element solution (0.5 g CaCl₂×2H₂O, 0.18 g ZnSO₄×7H₂O, 0.10 g MnSO₄, H₂O, 20.1 g Na₂-EDTA, 16.7 g FeCl₃×6H₂O, 0.16 g CuSO₄×5H₂O, 0.18 g CoCl₂×6H₂O, add 1L H₂O and autoclave) and 2 mg aminolaevulinic acid per 50 ml TB. E. coli cells were grown in this medium for 6 h at 37° C. then induced for 14 h at 42° C. All deep-well plates were covered with a taped lid. The original LBamp plates were stored until further use at 80° C. after adding 100 μl glycerol (sterile, 50% (v/v)).

Screening Procedures

[0151] All experiments were performed in organic solvent resistant polypropylene flat bottom 96 well plates (Greiner Bio-one). From each plate a blank pre-reading was recorded prior performing assay procedures.

[0152] Fast prescreen procedure. Deep well cultures were mixed well by a liquid handling machine (Multimek 96; Beckman) and 90 μl cell culture was transferred to each well of the reference and assay plate. To each well 40 μl Tris/HCl buffer (25 mM; containing 200 μM polymyxin B) and 5 μl 12-pNCA (15 mM, dissolved in DMSO) were pipetted using the Multimek. In addition 4.5 μl THF or 45 μl DMSO were transferred to each well of the assay plate. After an incubation time of 12 min the 12-pNCA conversion is initiated by adding 20 μl of a Isocitric co-factor regeneration solution (Isocitric acid 20 mM; dH₂O, NADP⁺ 3 mM, Isocitric dehydrogenase 0.8 U/ml). The reaction was stopped after visible color development by the addition of 50 μl NaOH (1.5 M). After 8-12 h incubation and removal of the bubbles using the Bunsen burner the absorption at 410 nm of the clear solution was recorded. The P450 BM-3 variants revealing a high activity and a high organic solvent resistance were used for rescreening.

[0153] Rescreen procedure. Deep well cultures were in contrast to the prescreening method centrifuged at 4000 rpm for 10-20 min to remove the brownish TB media. The cell pellets were frozen overnight at 20° C. and resuspended in 200 μl lysomix (pH 7.5, 25 mM Tris/HCl or 25 mM K_(x)PO₄ supplemented with 1-50 mg lysozyme (Sigma) per 100 ml). 90 μl cell suspension per well was transferred to the reference plate and the assay plate. To lyse the E. coli cells the plates were incubated at 37° C. for 1 h. To each well 30 μl Tris/HCl buffer (25 mM) and 5 μl 12-pNCA (15 mM, dissolved in DMSO) were pipetted using the Multimek96. 15 μl THF solution (15% (v/v); ddH₂O) were additionally added to each well of the assay plate. After an incubation time of 12 min the 12-pNCA conversion is initiated by adding 20 μl of a Isocitric co-factor regeneration solution (Isocitric acid 20 mM; dH₂O, NADP+ 3 mM, Isocitric dehydrogenase 0.8 U/ml). The reaction was stopped after visible color development by the addition of 100 μl UT-buster (NaOH 1.5 M, 1.5 M Urea, 50% (v/v) DMSO). After 8-12 h incubation and removal of the bubbles using the Bunsen burner the absorption at 410 mn of the clear solution was recorded. The P450 BM-3 variants revealing a high activity and a high organic solvent resistance were cultured and expressed in shaking flasks for further characterization.

Shaking Flask Cultures and Purification

[0154] Fifty and 500 ml cultures were inoculated with a 1:100 dilution of an overnight Luria-Bertani (LB) culture of recombinant E. coli DH5 containing the pT-USC1 BM-3 variant. The cells were shaken at 300 rpm at 37° C. At an OD578=0.8-1 the cells were induced by increasing the temperature to 42° C. After 8 h, the cells were harvested by centrifugation at 4-8° C. The cell pellet was resuspended in Tris-HCl (15 ml, 0.1 M, pH 7.8) and lysed by sonication (3×2 minutes; output control=5, duty cycle 40%; Sonicator, Heat Systems—Ultrasonic, Inc.). The lysate was centrifuged at 23,300 g for 30 min. The supernatant was further cleared through a low protein binding filter (0.45 μM). The filtrate was loaded on a SuperQ650M anion exchanger column (TosoHaas) and purified as previously described (Schwaneberg, 1999b).

Photometric Enzyme Assays

[0155] All photometric assays were carried out under aerobic conditions. UV/vis measurements were performed in a Shimadzu spectrophotometer (BioSpec-1601). P450 BM-3 F87A concentrations were measured by CO-difference spectra, as reported by Omura and Sato using ε=91 mM⁻¹cm⁻¹ (Omura and Sato, 1964). Conversion of the p-nitrophenoxydodecanoic acid (12-pNCA) was monitored at 410 nm using a ThermomaxPlus plate reader (Molecular Devices) and an ε=13,200 M⁻¹cm⁻¹ (Schwaneberg et al., 1999a). The principle of the p-nitrophenoxycarboxylic acid (pNCA) assay system is described in FIG. 9. ω-Hydroxylation of pNCA by P450 BM-3 leads to an unstable hemiacetal intermediate, which spontaneously dissociates into the ω-oxycarboxylic acid and the yellow chromophore p-nitrophenolate. This assay system allows a continuous photometric detection of the P450 BM-3 activity, as measured by the maximum turnover rate, i.e., the number of product molecules generated per minute(Schwaneberg et al., 1999a). TABLE 5 PCR Primers. “N” means that any nucleotide (A, T, G, or C) can be used. SEQ ID Designation Sequence (5′->3′) NO: For error-prone PCR pTBamHd GAA CCG GAT CCA TGA CAA TTA AAG AAA 13 TGC Rev3250 CTA TTC TCA CTC CGC TGA AAC TGT TG 14 For saturation mutagenesis at hot positions: pT235_F GCG ATG ATT TAT TAN NNC ATA TGC TAA 15 ACG GA pT235_R TCC GTT TAG CAT ATG NNN TAA TAA ATC 16 ATC GC pT471_F CAG TCT GCT AAA AAA GTA NNN AAA AAG 17 GCA GAA AAC GC pT471_R GCG TTT TCT GCC TTT TTN NNT ACT TTT 18 TTA GCA GAC TG pT1024_F GAC GTT CAC CAA GTG NNN GAA GCA GAC 19 GCT CGC pT3074_R GCG AGC GTC TGC TTC NNN CAC TTG GTG 20 AAC GTC For back-mutation of F87 position: A87F1 GCA GGA GAC GGG TTA TTT ACA AGC TGG 21 ACG A87F2 CGT CCA GCT TGT AAA TAA CCC GTC TCC 22 TGC F87G1y1 GCA GGA GAC GGG TTA GGT CAA GCT GGA 23 CG F87G1y2 CGT CCA GCT TGT ACC TAA CCC GTC TCC 24 TGC F87Trp1 GCA GGA GAC GGG TTA TGG ACA AGC TGG 25 ACG F871rp2 CGT CCA GCT TGT CCA TAA CCC GTC TCC 26 TGC F87His1 GCA GGA GAC GGG TTA CAC ACA AGC TGG 27 ACG F87His2 CGT CCA GCT TGT GTG TAA CCC GTC CTC 28 CTG C F87Asn1 GCA GGA GAC GGG TTA AAC ACA AGC TGG 29 ACG F87Asn2 CGT CCA GCT TGT GTT TAA CCC GTC TCC 30 TGC F87Asp1 GCA GGA GAC GGG TTA GAT ACA AGC TGG 31 ACG F87Asp2 CGT CCA GCT GTA TCT AAC CCG TCT CCT 32 GC F87Arg1 GCA GGA GAC GGG TTA CGT ACA AGC TGG 33 ACG F87Arg2 CGT CCA GCT TGT ACG TAA CCC GTC TCC 34 TGC F87Va11 GCA GGA GAC GOG TTA GTT ACA AGC TGG 35 ACG F87Va12 CGT CCA GCT TGT AAC TAA CCC GTC TCC 36 TGC F8711e1 GCA GGA GAC GGG TTA ATT ACA AGC TGG 37 ACG F8711e2 CGT CCA GCT TGT AAT TAA CCC GTC TCC 38 TGC F87Lys1 GCA GGA GAC GGG TTA AAA ACA AGC TGG 39 ACG F87Lys2 CGT CCA GCT TGT TTT TAA CCC GTC TCC 40 TGC

[0156] Mutgenesis conditions For all the PCR reactions the thermocycler PTC 200 (MJ Reseach) was employed. Component Volume (μl) Protocol 1: First mutant generation. ddH2O 42.5X-Y Buffer (10X)  5 dNTP (10 mM)  1 pT_USC1 BM-3  1 (of a mini-prep) RO X (27 pmol) Rev3250 Y (27 pmol) MnCl₂  0.04 mM Taq polymerase  5 U Total volume 50 PCR program: 94° C. for 4 min 94° C. for 1:10 min; 55° C. for 1:30 min 72° C. for 4 min (30 cycles) 72° C. for 10 min (1 cycle) Protocol 2 (Gene Morph kit): Second mutant generation ddH₂O 40.5X-Y Buffer 10X (Provided in kits)  5 dNTP mix (Provided in kits)  1 Plasmid  2.5 pT_BamHI X (20 pmol) Rev3250 Y (20 pmol) Mutazyme (Provided in kits)  1 Total volume 50 PCR program: 95° C. for 30 s (1 cycle) 95° C. for 30 s; 55° C. for 30 s; 72° C. for 3:30 min (30 cycles) 72° C. for 10 min (1 cycle) Protocol 3: Site directed and saturation mutagenesis. ddH2O 41X Y Pfu Buffer (10^(x)) (From Stratagene)  5 Plasmid (1:20 dilution)  2 dNTP mix (10 mM)  1 Forward primer X (17.5 pmol) Reverse primer Y (17.5 pmol) Pfu turbo (From Stratagene)  1 Total volume 50 PCR program: 94° C. for 4 min (1 cycle) 94° C. for 1:15 min; annealing 1:15 min; 68° C. for 16 min (20 cycles) 68° C. for 20 min (1 cycle) Annealing temperature: 55° C. for pT235, pT471 and pT102 60° C. for back-mutations at position 87

Mutagenesis and Results

[0157] The results of these experiments are shown in TABLES 6 and 7 and FIGS. 10-13, and discussed below. TABLE 6 Selected Cytochrome P450 Mutants Created From BM-3 F87A by Directed Evolution and other techniques. All mutations are relative to the wild-type cytochrome P450 BM-3 (SEQ ID NO: 2, and only non-silent mutations are shown. Generation/ Amino Acid Mutation Step Mutant Mutation Codon Mutation F87A F87A TTT→GCA 1st Generation F87AB5 F87A TTT→GCA T235A ACG→GCG S1024R AGT→AGA F87APEC3 F87A TTT→GCA R471C CGC→TGC Saturation mutagenesis F87ASB3 F87A TTT→GCA R471A CGC→GCT T235A ACG→GCG S1024R AGT→AGA F87ABC1F10 F87A TTT→GCA R1024T AGA→ACG T235A ACG→GCG R471A CGC→GCT F87ABC1B6 F87A TTT→GCA R1024K AGA→AAA T235A ACG→GCG R471A CGC→GCT 2nd Generation F87A5F5 F87A TTT→GCA R471A CGC→GCT E494K GAA→AAA R1024E AGA→GAG T235A ACG→GCG Back-mutation Wd A87F GCA→TTT W5F5 R471A CGC→GCT E494K GAA→AAA R1024E AGA→GAG T235A ACG→GCG WSB3 R471A CGC→GCT T235A ACG→GCG S1024R AGT→AGA WB5 T235A ACG→GCG S1024R AGT→AGA WBC1F10 R1024T AGA→ACG T235A ACG→GCG R471A CGC→GCT Site-directed F87G F87G GCC→GGT mutagenesis F87A F87A GCC (starting point) F87V F87V GCC→GTT F87I F87I GCC→ATT F87W F87W GCC→TGG F87D F87D GCC→GAT F87N F87N GCC→AAC F87H F87H GCC→CAC F87K F87K GCC→AAA F87R F87R GCC→CGT

[0158] TABLE 7 Relative increase in 12-pNCA specific activity in the absence of additional co-solvents for 12-pNCA (“Absence of Co-Solvents”); and relative increases in total activity at 10% (v/v) DMSO (“10% DMSO”) or 2% (v/v) THF (“2% THF”). P450 BM-3 Variant Absence of Co-Solvents 10% DMSO 2% THF F87A 1.00 1.0 1.0 F87A5F5 3.40 5.5 10.0 F87ASB3 2.53 3.7 5.7 F87AB5 2.85 3.7 5.3 F87ABC1F10 2.87 4.4 7.9 Wd 1.00 1.0 1.0 W5F5 2.51 5.9 3.4 WSB3 1.46 3.1 1.7 WB5 1.03 1.8 1.0 WBC1F10 2.01 3.5 2.3

First Mutant Generation

[0159] Random mutations were introduced by PCR into the BM-3 F87A gene coding for 1049 amino acids and a His6 tag at the C-terminal end, under conditions designed to generate an average of one to two amino acid substitutions per gene (protocol 1). The mutant library was screened for clones with improved organic solvent resistance by comparing the, activity in the presence and absence of a co-solvent. Approximately 6,520 clones were tested in 96 well plates using the 12-pNCA assay in presence and absence of an organic solvent. The candidates with high total activity and high activity ratios (activity in presence divided by activity in absence of an organic solvent) were selected for re-screening (protocol 1). Positive results of these assays were verified after expressing 39 clones of the first generation in 500 ml scale. Many of these clones were expressed higher than F87A. After lysing the E. Coli cells and determining the P450 content, the organic solvent activities were measured using 1 ml cuvettes. The percentage of the relative activity in presence of organic solvent divided by the activity in absence of a co-solvent of these 39 clones is shown in FIG. 10. For more than 50% of the re-screened clones a superior organic solvent resistance was found. Interestingly, apart from false positives two types of clones were discovered a) Increased DMSO and reduced THF resistance, b) Increased DMSO and increased THF resistance. No clones with increased DMSO and reduced THF resistance were detected. For our purposes, the mutants with multiple organic solvents resistances are the most interesting ones. The in DMSO most resistant mutants F87AB5 and F87APEC3 were selected for detailed analysis. Sequencing revealed three non-silent mutations, two in F87AB5 (T(ACG)235A(GCG); S(AGT)1024R(AGA)) and one in F87APEC3 (R(CGC)471C(TGC)). Interestingly, position 471 was again mutated in another clone, a substitution to serine instead of cystein was found. Under the assumption of identical expression levels an increased total activity of 3.7-fold at 10% (v/v) DMSO and of 5.3-fold at 2% (v/v) THF was measured for F87AB5 (Table 7).

Saturation Mutagenesis (SM)

[0160] Only a limited set of amino acid (aa) substitutions can be explored by PCR mutagenesis at low error-rates and many as substitutions that require the exchange of two or more nucleotides will not be present in these PCR libraries. SM at sites identified by error-prone PCR allows exploring these as changes and can result in the discovery of superior catalysts. Therefore, SM was used to introduce all 20 as into position to the double mutant B3. Screening of about 576 clones revealed a more active triple mutant SB3. When expressed and purified in parallel with F87A, F87ASB3 revealed a up to 4 times higher expression level than F87A and the DMSO and THF resistance was further improved as shown in FIGS. 11A, 11B and by the organic solvent resistance profile in FIG. 12A. Sequencing revealed an exchange of R(CGC)→471A(GCT). The SM of the position A235 of clone SB3 resulted in no further improvements. All 5 sequenced clones contained an alanine at aa-position 235. SM at position 81024 revealed two more active clones, F87ABCIFIO and F87ABCIB6. F87ABCIF10 contains a R(AGA)1024T(ACG) substitution and BC1B6 a R(AGA)1024K(AAA) substitution. A detailed analysis of F87ABCIFIO revealed increased total activity of 4.4 fold at 10% (v/v) DMSO and of 7.9-fold at 2 (v/v) THF compared to the grandparent F87A (Table 7).

Second Mutant Generation (GeneMorph Kit)

[0161] In parallel to the saturation mutagenesis, random mutations were introduced into the F87ASB3 clone. The Taq-Polymerase used in the first round of error-prone PCR has a strong bias to transitions whereas the polymerase in the GeneMorph is biased toward transversions (Stratagene Inc.). This round of error-prone PCR should therefore result in a different set of mutation and as changes. After only screening about 1440 clones, the mutant F87A5F5 was found. Sequencing revealed two non-silent transversions at aa-positions E(GAA)494K(AAA) and R(AGA)1024E(GAG). The simultaneous exchange of three nucleotides to the complementary one at position 1024 seems to be very unlikely, however it has been confirmed by double sequencing this position in clone F87ASB3 and F87A5F5. F87ASF5 revealed a higher organic solvent resistance for DMSO and THF than any previous clones. For F87A5F5 a, compared to F87A, increased total activity of 5.5-fold at 10% (v/v) DMSO and of 10-fold at 2% (v/v) THF (Table 7) was discovered. The mutagenic pathways starting from the parent F87A are summarized in FIGS. 11A and 11B and the organic solvent resistance profiles in DMSO, THF, acetone, acetonitrile, DMF, and ethanol are shown in FIGS. 12 and 13. In particular, FIGS. 12A-G show that the evolved mutants, including F87A5F5, exhibit an increased resistance to these organic solvents.

Purification

[0162] F87A, F87ASB3 and the wild-type were simultaneously expressed and one by one purified (FIG. 12C). A comparison of the organic solvent resistance between lysed crude extracts and purified monoxygenase revealed very similar resistance toward DMSO (FIGS. 12C and 13A). For THF, the resistance of the purified enzyme was reduced between 3-9%. This reduction might be correlated to hydrophobic impurities in crude protein extracts and relatively small amounts of THF present in the reaction solution.

Back-Mutation of Position 87

[0163] The comparison of the organic solvent activity of the wild-type and the mutant F87A revealed a significant higher resistance of the wild-type enzyme in organic co-solvents (FIG. 12C). Therefore, the best clones of the laboratory evolution experiment (FIG. 3) were back-mutated at position 87 to the wild-type. FIGS. 12B and 13B show that the activity profile of the fraction of in organic solvent active clones is especially for DMSO shifted to higher co-solvent concentrations. For example, the back-mutated clone W5F5 has an increased total activity of 5.9-fold at 25% (v/v) DMSO and of 3.4 fold at 2 (v/v) THF compared to the wild-type (Table 7). However, improvements are generally for the wild-type mutants, especially in the case of THF, lower compared to the factors for F87A. Interestingly, 30% (v/v) DMSO seems to be a threshold value to trigger a reduced organic solvent resistance. However, the expression rates and activity of W5F5 are sufficient to use this mutant as a starting point for further directed evolution.

[0164] The results of the back-mutation encouraged us to investigate in detail the influence of position 87 toward the activity of P450 BM-3 in organic co-solvents. By site-directed mutagenesis as changes from F to G, A, V, I, F, W, D, N, H, K and R were introduced (protocol 3) and confirmed by sequence analysis. The as changes to A, F revealed a fast 12-pNCA conversion and to I, V, G a lower one. All other mutants showed no detectable activity for 12-pNCA. These results were confirmed by using coumarone as a substrate and the Gibbs assay to detect hydroxylated products. The only exception was a low activity of the F87H toward coumarone. Initial analysis of the active clones discovered a size-depending organic solvent resistance in the order F>I>A, G, V.

[0165] Evolution has generated a stunning variety of enzymes through mutation/recombination and natural selection. However, monooxygenases are not well suited for industrial application. The results reported herein proves that laboratory evolution offers a fast and elegant way to adapt these enzymes to our needs in biotechnology applications. The achieved improvements in organic co-solvents resistance will bring this exceptional class of enzymes a step closer to industrial applications.

Example 3 Investigations of Solvent Conditions and Substrates for P450 BM-3 Mutant IX139-3

[0166] Cytochrome P450 BM-3 from Bacillus megaterium (Bodupalli et al., 1990) (P450 BM-3), a medium-chain (C12-C18) fatty acid monooxygenase, has been converted into a highly efficient catalyst for the conversion of alkanes to alcohols (See Example 1). The evolved P450 BM-3 exhibits higher turnover rates than any reported biocatalyst for selective oxidation of hydrocarbons. Unlike naturally-occurring alkane hydroxylases, among the best known of which are the large, membrane-associated complexes of methane monooxygenase (MMO) and AlkB, the evolved enzyme is water soluble and does not require additional proteins for catalysis. The evolved alkane hydroxylase was found to have even higher activity on fatty acids, the presumed biological substrates for P450 BM-3, which was already one of the most efficient P450s known. A broad range of substrates that includes the gaseous alkane propane induces the low to high spin shift, which activates the enzyme. The first soluble catalyst for alkane hydroxylation at room temperature, this laboratory-evolved P450 opens new opportunities for clean, selective, hydrocarbon activation for chemical synthesis and bioremediation.

Materials and Methods Expression of P450 BM-3 Variants

[0167] See Example 1.A. Expression and purification of P450 BM-3 variants.

Mutagenic PCR and StEP Recombination

[0168] For the first two generations, mutagenic PCR of the heme domain was performed as described (Farinas et al., 2001), using the following primers together with Taq polymerase (Roche): Bamfor 5′-ACAGGATCCATCGATGCTTAGGAGGTCATATG- (SEQ ID NO:41) 3′ Sacrev 5′-GTGAAGGAATACCGCCAAG-3′ (SEQ ID NO:42)

[0169] The PCR product was cloned by replacing the BamHI/SacI fragment of p BM-3_(—)_WT18-6. Nine mutants from generation 2 showing at least 2-fold improved activity on 8-pnpane were recombined by staggered extension process (StEP) (see Example 1; Zhao et al., 1998) using the same primers and 10 seconds extension time. A variant with 3 mutations (V78A, H236Q, E252G) with at least 2-fold improvement in activity relative to the parents was isolated. The 4th and 5th generations were generated by error-prone PCR using the Genemorph kit (Stratagene) according to the manufacturer's protocol, using approximately 1 to 10 ng of template DNA. The most active mutant, IX139-3, was isolated from the 5th generation. Sequencing of the gene revealed 13 point mutations. Eleven lead to amino acid substitutions (V78A, F107F, H138Y, T175I, V178I, A184V, H236Q, E252G, R255S, A290V, A295T, L353V, Q397Q), and two were synonymous.

Preparation of Cell Lysates

[0170] For high throughput screening, clones from the first three generations were cultivated as described (See Example 1; and Farinas et al., 2001). For subsequent generations, colonies were picked and inoculated by a Qpix (Genetix) robot into Luria Bertani media (LB, 350 μL, 100 mg/L ampicillin) into 1 mL deep well plates. The plates were incubated at 30° C., 250 rpm, and 80% relative humidity. After 24 hours, clones from this pre-culture were inoculated using a 96 replicator pin into 2 ml deep well plates containing Terrific broth media (TB, 400 μL), ampicillin (100 mg/L), isopropy-B-D-thiogalactoside (IPTG, 10 μM), and ALA (0.5 mM). The clones were cultivated at 30° C. for 24-30 hours. Cell pellets were frozen at −20° C. and resuspended in phosphate buffer (1 mL, 0.1 M, pH 8.0) containing lysozyme (0.5 mg/mL), DNase I (0.1 μg/mL) and MgCl₂ (10 mM). After 60 min at 37° C., the lysates were centrifuged and the supernatant was diluted for activity measurements in 96 well microtiter plates.

High Throughput Determination of Enzymatic Activity

[0171] Mutant libraries were screened on 8-pnpane as described (See Example 1 and Farinas et al., 2001). A cofactor (NADPH) depletion assay was used to determine turnover rates. E. coli lysates of the mutants were diluted into 96 well microtiter plates containing phosphate buffer (200 μI, 0.1 M, pH 8.0), alkane substrate (0.5-1 mM), and DMSO (1%). The liquid alkanes were added to the buffer using alkane stock solutions in DMSO, whereas the gaseous alkanes were bubbled into buffer for about 45 min to obtain saturated solutions. The reaction was initiated by addition of NADPH (200 mM), and the oxidation of NADPH was monitored at 340 nm. A total of about 10,000 colonies were screened over 5 generations. The determination of the maximal turnover rate is described in Example 1.

Solid Phase Assay

[0172] A solid phase NADPH depletion assay was used preselection of the fifth-generation mutant library. Cells were grown on LB agar plates containing ampicillin (100 μg/ml), ALA (1 mM) and isopropyl-B-d-thiogalactopyranoside (10 μM).

[0173] The assay solution contained phosphate buffer (0.1 M, pH 8.0), polymyxin B sulfate (100 μM) as a cell permeabilizer, NADPH (2 mM), and substrate (5 mM) and was sonicated before use. A nitrocellulose membrane soaked with this substrate solution was placed directly onto the colonies on the agar plate. The sensitivity of the assay was regulated by the NADPH concentration. After the reaction (about 5-15 min), nitro blue tetrazolium (0.5 mg/ml) in phosphate buffer (0.1 M ,pH 8.0) and some crystals of phenazine methosulfate were pipeted directly onto the membrane. Active colonies, which deplete NADPH, were identified as white spots on the purple membrane. Positive colonies were picked with a toothpick and streaked out on fresh agar plates to obtain single colonies for rescreening.

GC/MS Analysis

[0174] See Example 1.

Substrate Binding

[0175] Dissociation constants for octane, hexane, and lauric acid were determined at 25° C. as described (Modi et al., 1995, hereby incorporated by reference in its entirety) from the change in absorption at 418 nm upon substrate binding. For the alkanes, an enzyme solution (3-5 mM) in buffer (0.1 M potassium phosphate pH 8.0) was titrated with a stock solution of alkane (octane: 2 mM in methanol; hexane: 10 mM in methanol). Methanol (1%) added to an enzyme solution does not induce a spin state shift. For laurate, the reaction solution contained enzyme (3-5 mM) and laurate (1 mM) in buffer (20 mM MOPS, 100 mM KCl, pH 7.4). Aliquots of the enzyme/substrate solution were removed and replaced with an equal volume of an enzyme solution.

Results

[0176] Table 8 shows the relative amounts of different products obtained for alkane oxidation, comparing wild-type cytochrome P450 BM-3 to the evolved mutant enzyme IX139-3. TABLE 8 Product distribution for alkane oxidation by wild-type P450 BM-3 and IX139-3 Substrate Product 139-3 (%) Wild-type (%) Octane 2-octanol 66 17 3-octanol 32 40 4-octanol 2 43 Hexane 2-hexanol 19 20 3-hexanol 81 80 Cyclohexane cyclohexanol 100 100 Butane 2-butanol 100 Not determined Propane 2-propanol 100 Not determined

[0177] The results show that laboratory evolution methods consisting of sequential rounds of random mutagenesis, in vitro recombination, and high throughput screening converted this highly efficient fatty acid monooxygenase into one that can hydroxylate hexane and other alkanes similarly well. In a preliminary study (Farinas et al., 2001), it was verified that P450 BM-3 showed very low activity towards octane (Munro et al., 1993). A calorimetric screen using p-nitrophenoxy octane (8-pnpane) as an alkane substrate analog identified more active clones. Unfortunately, a plateau in the enzyme's activity on alkanes was reached after a few rounds of evolution, and further screening yielded no new improvements.

[0178] When designing a screening strategy for identifying cytochrome P450 BM-3 mutants that catalyze the hydroxylation of alkanes, it was necessary to have an assay sensitive enough to observe improvements when the activities were still very low. The colorimetric assay on the surrogate substrate 8-pnpane nicely fulfilled this role. The risk to using a surrogate substrate such as 8-pnpane, however, is that the enzyme may become ‘addicted’ to that particular substrate. Activity towards the desired substrate may not be increasing in the same proportion (or not at all). By the third generation of mutagenesis and screening, the most active BM-3 variant acquired sufficient activity to enable us to screen mutant libraries for activity directly on an alkane (octane) using a high throughput NADPH consumption assay (see Example 1). NADPH oxidation alone, however, may not provide an accurate measure of catalytic activity since reducing equivalents from NADPH can be diverted into forming reduced oxygen intermediates (H₂O or H₂O₂). Therefore all subsequent generations were screened using a combination of the 8-pnpane assay, sensitive to product formation, and NADPH consumption in the presence of octane (see Experimental).

[0179] By monitoring cell growth under conditions where the P450 enzyme was expressed, we also determined that the enzyme had become toxic to the E. coli cells during the process of acquiring higher activity towards alkanes. This increased toxicity placed an artificial barrier on how active the enzyme could become and still appear as a positive during high throughput screening for alkane hydroxylase activity. By altering the growth and expression conditions to limit enzyme production during growth we were able to continue the evolution and identify P450 BM-3 mutants with very high alkane hydroxylation activities.

[0180] Five generations of mutagenesis and screening yielded P450 BM-3 mutant IX139-3. As shown in FIG. 14, the enzyme evolved using 8-pnpane and octane to screen for more active clones was highly active on octane, hexane, cyclohexane and pentane. For example, IX139-3 is 38-fold more active on octane than the wildtype enzyme. The rates for hydroxylation of all the liquid alkanes exceed that of wildtype P450 BM-3 on its fatty acid substrates, lauric and palmitic acid (FIG. 14). The evolved enzyme was also 2-fold more active on palmitic acid (FIG. 14), which was surprising since this fatty acid is presumably one of the natural, and optimal, substrates for P450 BM-3. Analysis of the products of reaction with n-alkanes showed hydroxylation at subterminal positions (Table 8), similar to wildtype enzyme's regioselectivity on fatty acids. No further oxidation to diols or ketones was observed. Cyclohexanol was the sole product of hydroxylation of cyclohexane. For the oxidation of octane, hexane, and cyclohexane, the ratio of products formed to dioxygen consumed was 1:1, as determined by GC/MS, and H₂O₂ was not detected. This demonstrated that reducing equivalents derived from NADPH result in substrate hydroxylation and the mutant does not waste electrons to produce reduced oxygen intermediates. H₂O₂ is not detected for the oxidation of the remaining substrates, and it is assumed that substrate hydroxylation is fully coupled to NADPH oxidation.

[0181] P450 BM-3 IX139-3 is a better catalyst than known, naturally-occurring alkane monooxygenases acting on their most preferred substrates (FIG. 15). For example, the preferred substrate for the non-heme iron alkane monooxygenase (AlkB) from Pseudomonas oleovorans is octane; its reported maximal turnover rate is 210 min-1 (Shanklin et al., 1997). In contrast to P450 BM-3, AlkB is membrane-bound and requires two additional proteins, NADPH-rubredoxin reductase and rubredoxin, for catalytic activity. Soluble (sMMO) and particulate methane monooxygenase (PMMO) are large (about 300 kDa), multimeric iron (and in the case of pMMO, iron-copper) enzymes with turnover rates of 200-250 min⁻¹ for gaseous alkanes (methane, 222 min⁻¹) (Fox et al., 1990; Fox et al., 1989; Tonge et al., 1977). Rates on alkanes larger than C4 are much lower (Green and Dalton, 1989). Furthermore, AlkB and MMO have not been produced with high activity in heterologous hosts suitable for protein engineering (Shanklin 1997; Murrell et al., 2000; Staijen et al., 2000). Among the best known of the cytochrome P450 alkane hydroxylases is P450 Cm1 (CYP52 A3), which is involved in alkane metabolization in Candida maltosa (Zimmer et al., 1996). This enzyme is also membrane-bound, requires a separate reductase, and has turnover rates lower than AlkB (27 min⁻¹ for the purified protein and 40 min⁻¹ for microsomal preparations) (Scheller et al., 1996). A number of other P450s also show low levels of activity for alkanes (Stevenson et al., 1996; Fisher et al., 1998; Munro et al., 1993).

[0182] From the intriguing possibility that the P450 BM-3 could be engineered to accept the small, gaseous hydrocarbon substrates preferred by MMO, the ability of the IX139-3 mutant to hydroxylate butane and propane was determined. Since the screen identified mutants more active on 8-pnpane and longer-chain alkanes, high activities on propane and butane were not necessarily expected. Based upon NADPH consumption, IX139-3 oxidized butane and propane with initial rates of 1800 and 860 min⁻¹, respectively (FIG. 14), which compared favorably to those of the much larger MMO. Comparing with the wildtype, IX139-3 hydroxylates butane and propane 108 and 57 times faster, respectively. The sole products of propane and butane oxidation by P450 BM-3 IX139-3 were 2-propanol and 2-butanol (Table 8).

[0183] The P450 resting state contains an iron protoporphyin IX as a low-spin six-coordinate ferric species with a dissociable water ligated trans to the proximal cysteinate (Ortiz de Montellano, 1995). Substrate binding displaces water and generates a high-spin five-coordinate species. The spin state shift causes the redox potential of the heme to increase, which activates the protein for hydroxylation. The heme's absorption maximum at 419 nm corresponds to the low-spin species; a shift to 390 nm is characteristic of the high-spin form. This spectral shift is induced in IX139-3 by all the substrates (FIG. 16A), which allowed estimations of K_(d)'s for octane, hexane, and laurate of 10 μM, 27 μM, and 19 μM, respectively. Only the fatty acid substrates produce a spin shift in the wildtype enzyme (laurate: K_(d)=260 μM).

[0184] Crystal structures of wildtype P450 BM-3 with and without substrate reveal large conformational changes upon substrate binding at the active site (Haines, D. C., Tomchick, D. R., Machius, M. & Peterson, J. A. Pivotal role of water in the mechanism of P450 BM-3. Biochemistry 40, 13456-13465 (2001); Li, H. Y. & Poulos, T. L. The structure of the cytochrome p45OBM-3 haem domain complexed with the fatty acid substrate, palmitoleic acid. Nat. Struct. Biol. 4, 140-146 (1997); Paulsen, M. D. & Ornstein, R. L. Dramatic Differences in the Motions of the Mouth of Open and Closed Cytochrome P450bm-3 by Molecular-Dynamics Simulations. Proteins 21, 237-243 (1995); Chang, Y. T. & Loew, G. Homology modeling, molecular dynamics simulations, and analysis of CYP119, a P450 enzyme from extreme acidothermophilic archaeon Sulfolobus solfataricus. Biochemistry 39, 2484-2498 (2000)).The substrate free structure displays an open access channel with 17 to 21 ordered water molecules. Substrate recognition serves as a conformational trigger to close the channel, which dehydrates the active site, increases the redox potential, and allows dioxygen to bind to the heme. Five of the 11 amino acid substitutions in IX139-3 occur in the region which undergoes the largest conformational change, the F and G helices and the loop connecting them, as well as the I helix across which the F and G helices must slide (FIG. 4). The F and G helices serve as a lid which closes over the substrate access channel upon substrate binding.

[0185] Attempts to engineer catalyst specificity are often limited to altering amino acids directly involved in substrate recognition and binding. Only one residue that is in direct contact with substrate in the wildtype enzyme has been mutated in IX139-3 (V78A). Amino acids R47, Y51, F42, and F87 have been proposed to be essential for fatty acid substrate recognition (Noble et al., 1999). R47, Y51, and F42 are located at the mouth of the substrate-binding pocket. R47 and Y51 interact with the substrate carboxylate moiety through electrostatic and hydrogen bonding interactions, while F42 serves as a hatch covering the binding pocket. None of these important residues has been mutated in IX139-3. A salt bridge between R255 and D217 in the substrate free structure can be disrupted by the R255S mutation in IX139-3. This mutation may facilitate conformational changes that permit alkanes to bind more favorably. Rational engineering of the substrate binding pocket of P450 BM-3 produced a triple mutant (F87V, L188Q, A74G) with increased activity for octane (Appel et al., 2001). Directed evolution to produce mutant IX139-3 did not find any beneficial mutations at these sites.

[0186] The fact that a few amino acid substitutions can produce a significant increase in P450 BM-3's activity on fatty acids, the presumed biological substrates, indicates that natural selection does not place an advantage on maximizing activity, possibly because such a broadly active enzyme is also toxic to the host organism, as it is to E. coli. By evolving the enzyme uncoupled from its biological context we are able to uncover the true catalytic potential of the cytochrome P450. P450 BM-3 mutant IX139-3 is the fastest alkane hydroxylase known. Easily over-expressed in E. coli, soluble and requiring no additional electron transfer proteins for catalysis, this enzyme should prove an attractive catalyst for selective hydrocarbon oxidation.

Example 4 Alkene Oxidation for P450 BM-3 Mutant IX139-3

[0187] Epoxidation of alkenes is an important reaction in organic synthesis since they are important chemical building blocks. The oxirane ring is subject to ring opening by various nucleophiles (oxygen, sulfur, nitrogen, carbon), which yield bifunctional compounds (Carey and Sundberg, 1990). In the chemical industry, epoxides are used in the production of polymers (polyether polyols), as well as glycols, polyglycols, and alkanolamines. Optically pure epoxides are useful intermediates in the synthesis of pharmaceuticals, agrochemicals, perfumes, and liquid crystals where chirality plays a critical role in function. Great efforts in developing chemical catalyst for alkene epoxidation has resulted in limited successes (Jacobson, 1993; White et al., 2001). The main limitations of chemical methods for alkene epoxidation is poor catalytic efficiencies for trans and terminal olefins (Faber, 2000). Furthermore, chemical methods produce large amounts of toxic byproducts. For example, the current industrial processes for the synthesis of propylene oxide from propene use large amounts of Cl₂ that lead to equipment corrosion and toxic byproducts. Monooxygenases provide an alternative to chemical means for epoxidation (Schmid et al., 2001).

[0188] The mutant IX139-3 was shown to have broad substrate specificity for alkanes with varying chain length (C8-C3). Furthermore, the variant was also shown to be more active on fatty acids. The activity for alkenes was also investigated, and the mutant is also more efficient in alkene oxidation. IX139-3 may prove to be a general catalyst for hydrocarbon oxidation, and it may find uses in the fine chemical industry as well as in bioremediation.

Materials and Methods

[0189] All chemical reagents were procured for Aldrich, Sigma, or Fluka.

Expression of P450 BM-3 Mutant

[0190] See example 1. A. Expression and purification of p450 BM-3 variants

Determination of the Maximum Initial Rate for Alkene Oxidation

[0191] The enzyme was purified and quantified as described above. A typical reaction solution contained enzyme (1.0 ml, 1 μM), alkene (10 μL, 1.0 mM), and methanol (1%) in potassium phosphate buffer (0.1 M, pH 8.0). The solution was incubated at room temperature, and the reaction was initiated by the addition of NADPH (200 μL, 200 μM). The rate of NADPH oxidation was monitored at 340 nm.

Substrate Conversion and Product Characterization

[0192] A typical reaction contained purified enzyme (1.0 ml, 1 μM), alkene (10 82 L, 1.0 mM), and methanol (1%) in phosphate buffer (0.1 M, pH 8.0). The solution was incubated at room temperature for 5 minutes, and the reaction was initiated by the addition of NADPH (200 μL, 200 μM). The solution was allowed to stir aerobically at room temperature for 30 minutes. For propene oxidation, the reaction contained enzyme (3.0 ml, 1.0 μM) in potassium phosphate buffer (0.1 M, pH 8.0), and the resulting solution was sealed in a 20 ml vial with a septum. The head-space was filled with propene and the reaction was initiated by the addition of NADPH (100 μl, 0.5 mM). The reaction was stirred at room temperature for 1.5 hours.

[0193] The products were analyzed by gas chromatography/mass spectrometry using an Hewlett Packard HP 6890 series gas chromatograph coupled with an Hewlett Packard 5973 mass selective detector). The GC was fitted with a HP FFAP column (crosslinked FFAP, 30 m×0.25 mm×0.25 μm film thickness). The condition for propene is as follows: (1) 35° C. for 1.7 minutes. (2) 35 to 200° C. at 20° C./min. (3) 200° C. for 1 minute. The condition for styrene, cyclohexene, and 1-hexene is as follows: (1) 100° C. for 4.5 minutes. (2) 100 to 200° C. at 20° C./min. (3) 200 for 7.0 minutes. Authentic standards were used to identify the products by retention time. Products were further verified by matching the fragmentation distributions with a database in the software provided by the instrument manufacturer.

4-(4-Nitrobenzyl)pyridine Assay

[0194] A typical reaction contained enzyme (1.0 ml, 1×10⁻⁶ M) in potassium phosphate buffer (0.1 M, pH 8.0), alkene (1.0×10⁻³ M), and methanol (1%) in a vial. The reaction was initiated with NADPH (50 μl, 1.0×10⁻³ M), and allowed to stir aerobically for 5 minutes. 300 μl of a stock solution of 4-(4-nitrobenzyl)pyridine (5% w/w in acetone) was added to the reaction, and the vial was sealed. The reaction was incubated at 80° C. for 20 minutes and chilled on ice. 600 μl of an ethylacetate/acetone (5:2) solution and 300 μl 5 M aqueous NaOH was added to the reaction solution. The solution was mixed thoroughly, and the absorbance of the organic layer was measured at 540 nm in a glass 96-well plate with microplate spectrophotometer.

Substrate Binding

[0195] See Example 3: Substrate binding.

Results and Discussion

[0196] P450 BM-3 is known to form epoxides from various substrates that vary in size and structure (Martinez and Stewart, 2000; Fruetel et al., 1994; Capdevila et al., 1996; Ruettinger and Fulco, 1981; Schneider et al., 1999). For example, P450 BM-3 oxidizes arachidonic acid to 18(R)-hydroxyeicosatetraenoic acid and 14(S),15(R)-epoxyeicosatrienoic acid (80 and 20% of total products, respectively), and eicosapentaenoic acid to 17(S),18(R)-epoxyeicosatetraenoic acid (99% total products) (Capdevila et al., 1996). Furthermore, stryene is oxidized solely to styrene oxide (Fruetel et al., 1994).

[0197] The evolved enzyme IX139-3 has been shown to be highly active for alkane and fatty acid oxidation (see above). Interestingly, the initial rates of NADPH consumption in the presence of alkenes were also very high (FIG. 17). For example, the rate of cyclohexene and styrene is 1200 and 300 fold more active than wildtype, respectively. For all the alkenes investigated, H₂O₂ is not detected and the enzyme is assumed to be fully coupled. However, there is a possibility that H₂O can be produced, which can only be accurately determined by quantitating product formation.

[0198] The products for styrene, propene, cyclohexene, and 1-hexene were characterized by gas chromatography/mass spectrometry. The sole product for stryene was styrene oxide, which is similar to what is found for the wildtype (Martinez et al., 2000). The major product from propene oxidation is propene oxide and allyl alcohol is only a minor species. Propene oxidation has not been previously reported by P450 BM-3 (See FIG. 18).

[0199] The products for styrene, propene, cyclohexene, and 1-hexene were characterized by gas chromatography/mass spectrometry. The sole product for stryene was styrene oxide, which is similar to what is found for the wildtype (Fruetel et al., 1994). The products from propene oxidation is propene oxide and allyl alcohol. Propene oxidation has not been previously reported byP450 BM-3.

[0200] GC/MS analysis of cyclohexene oxidation by IX139-3 revealed two products, cyclohexene oxide and 1,2-cyclohexane diol. The sole product of the bioconversion was most likely cyclohexene oxide, and 1,2-cyclohexane diol probably occured via base-catalyzed hydrolysis of cyclohexene oxide (see reaction conditions) (March, 1992). Cyclohexene oxide was converted to 1,2-cyclohexane diol in potassium phosphate buffer (0.1 M, pH 8.0), as verified using GC/MS.

[0201] No epoxidation of 1-hexene was detected, and the product was 1-hexene-3-ol, which results from hydroxylation at the allylic position. This was expected since wildtype P450 BM-3 preferentially oxidizes fatty acids with a terminal double bond at the ω-2 position and the corresponding terminal epoxide is not formed (Shirane et al., 1993). The selectivity may be due to the C—H bond strength. The ω-2 secondary allylic (DH°298˜83 kcal/mol) C—H are weaker than the ω-3 secondary (DH°298˜98 kcal/mol) C—H bond. However, a terminal double bond should also be oxidized unless there is a mechanistic or steric factor. P450 BM-3 is known to form the corresponding epoxide from cis-9-hexadecenoic acid as well as hydroxylated products (Ruettinger and Fulco, 1981), which demonstrates that there is no mechanistic restraint. Hence, a steric restraint exist that hinders terminal epoxidation. This is further supported by the fact terminal oxidation of fatty acids and alkanes are not observed.

[0202] The 4-(nitrobenzyl)pyridine (4-NBP) (Kim and Thomas, 1992) assay was used to determine epoxide formation for allyl chloride, isoprene, 2-hexene and 3-hexene since the corresponding epoxides were not available as standards. Nucleophilic attack of the oxirane ring by 4-NBP results alkylation, and the product results in a purple colored (AbS_(max)˜550 nm) species. IX139-3 in the presence of isoprene, dioxygen and NADPH forms the alkylated product (FIG. 19) with a λ_(max)˜550 nm. Similar results occur when allyl chloride is the substrate. The color develops only when enzyme, substrate, and NADPH are present. If any of the components are missing then no color is formed. Control experiments with IX139-3, styrene, and NADPH, which is shown to form styrene oxide as determined by GC/MS, also test positive using the 4-NBP assay. When either 2-hexene or 3-hexene are used as substrates, no alkylation product is formed, and it is assumed that the alkene is hydroxylated.

[0203] 139-3 in the presence of alkenes induces small spin state shifts. Whereas, the wildtype in presence does not produce a spin state shift. This indicates that the substrate binding properties have been altered. IX139-3 is very active for alkane hydroxylation, fatty acid oxidation and alkene epoxidation, and the substrate specificity is broad. This mutant may be useful a general catalyst for hydrocarbon oxidation. The mutant may be useful for fine chemical synthesis when only one product is detected such as styrene oxide. Alternatively, the mutant may find applications in bioremediation when more than one product is generated.

General Definitions

[0204] The following defined terms are used throughout the present specification, and should be helpful in understanding the scope and practice of the present invention.

[0205] “Cytochrome P450 monooxygenase” or “P450 enzyme” means an enzyme in the superfamily of P450 haem-thiolate proteins, which are widely distributed in bacteria, fungi, plants and animals. The enzymes are involved in metabolism of a plethora of both exogenous and endogenous compounds. Usually, they act as oxidases in multicomponent electron transfer chains, called here P450-containing monooxygenase systems. The unique feature which defines whether an enzyme is a cytochrome P450 enzyme is the reduced form of the enzyme which binds carbon monoxide and results in a characteristic absorption maximum at 450 nm.. Reactions catalyzed by cytochrome P450 enzymes include epoxidation, N-dealkylation, O-dealkylation, S-oxidation and hydroxylation. The most common reaction catalyzed by P450 enzymes is the monooxygenase reaction, i.e., insertion of one atom of oxygen into a substrate while the other oxygen atom is reduced to water. Although any P450 enzyme can be modified according to the invention, the following are non-limiting examples of preferred P450 enzymes: P450 BM-3 (GenBank Accession Nos. J04832 (SEQ ID NO: 1) and P14779 (SEQ ID NO: 2)); CYP 2C3 (GenBank P00182, SEQ ID NO: 3); CYP 2C9 (GenBank P11712; SEQ ID NO: 4), CYP 2D1v (GenBank P10633; SEQ ID NO: 5), and CYP 108 (GenBank P33006; SEQ ID NO: 6).

[0206] An “oxidation”, “oxidation reaction”, or “oxygenation reaction”, as used herein, is a chemical or biochemical reaction involving the addition of oxygen to a substrate, to form an oxygenated or oxidized substrate or product. An oxidation reaction is typically accompanied by a reduction reaction (hence the term “redox” reaction, for oxidation and reduction). A compound is “oxidized” when it receives oxygen or loses electrons. A compound is “reduced” when it loses oxygen or gains electrons. An oxidation reaction can also be called an “electron transfer reaction” and encompass the loss or gain of electrons (e.g., oxygen) or protons (e.g., hydrogen) from a substance. Non-limiting examples of oxidation reactions include hydroxylation (e.g., RH+O₂+2H⁺+2e⁻→ROH+H₂O), epoxidation (RCH═CHR′+2H⁺ +O ₂+2e⁻→RCHOCHR′+H₂O), and ketone formation (RCH₂R′→RCOR′).

[0207] A “co-solvent” or “indirect solvent” herein means a non-solvent that becomes an acceptable solvent or co-solvent when a small amount of active solvent is added. For example, water is a non-solvent for various hydrophobic substances, but the addition of a water-miscible solvent such as DMSO, tetrahydrofuran (THF), methanol, ethanol, propanol, dioxane, or dimethylformamide (DMF), or other solvents known in the art, increases the solubility of hydrophobic compounds in the mixture.

[0208] The “organic solvent resistance” of an enzyme means its ability to function, optionally function over time, in an organic solvent or in a co-solvent. One way to evaluate organic solvent resistance is to assess the ability of the enzyme to resist a loss of activity over time, in one or more co-solvent systems . A more “organic-solvent resistant” enzyme can be one that is more resistant to loss of structure (unfolding) or function (enzyme activity) when exposed to an organic solvent or co-solvent. Preferred systems for testing organic solvent resistance include water/DMSO and water/THF mixtures, for example, 10% (v/v) DMSO and 2% (v/v/) THF.

[0209] “Alkane-oxidation capability” herein means the capability of a cytochrome P450 enzyme to oxidize at least one alkane. The alkane-oxidation capability of an enzyme can be evaluated, for example, by mixing the enzyme with an alkane in the presence of any necessary co-factors, and evaluate whether the alkane is oxidized. In particular, the capability of a cytochrome P450 variant to oxidize an alkane can be related to the capability of the corresponding wild-type P450 to oxidize the same alkane, e.g., by comparing maximum turnover rate, total amount of product formed, or any other suitable measure of enzyme activity. Many examples of alternative assays are provided herein. Preferred alkanes for which alkane-oxidation capability can be evaluated include 8-pnpane, octane, hexane, cyclohexane, propane, ethane, and/or butane.

[0210] “Alkene-oxidation capability” herein means the capability of a cytochrome P450 enzyme to oxidize at least one alkene. The alkene-oxidation capability of an enzyme can be evaluated, for example, by mixing the enzyme with an alkene in the presence of any necessary co-factors, and evaluate whether the alkene is oxidized to form an epoxide or hydroxylated product. In particular, the capability of a cytochrome P450 variant to oxidize an alkene can be related to the capability of the corresponding wild-type P450 to oxidize the same alkene, e.g., by comparing maximum turnover rate, total amount of product formed, or any other suitable measure of enzyme activity. Many examples of alternative assays are provided herein. Preferred alkenes for which alkane-oxidation capability can be evaluated include octene, hexene, propene, ethene, and/or butene.

[0211] The term “about” or “approximately” means within an acceptable error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, i.e., the limitations of the measurement system. For example, “about” can mean a range of up to 20%, preferably up to 10%, more preferably up to 5%, and more preferably still up to 1% of a given value. Alternatively, particularly with respect to biological systems or processes, the term can mean within an order of magnitude, preferably within 5-fold, and more preferably within 2-fold, of a value.

Molecular Biology Definitions

[0212] In accordance with the present invention there may be employed conventional molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (herein “Sambrook et al., 1989”); DNA Cloning: A Practical Approach, Volumes I and II (D. N. Glover ed. 1985); Oligonucleotide Synthesis (M. J. Gait ed. 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. (1985)); Transcription And Translation (B. D. Hames & S. J. Higgins, eds. (1984)); Animal Cell Culture (R. I. Freshney, ed. (1986)); Immobilized Cells And Enzymes (IRL Press, (1986)); B. Perbal, A Practical Guide To-Molecular Cloning (1984); F. M. Ausubel et al. (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994).

[0213] A “protein” or “polypeptide”, which terms are used interchangeably herein, comprises one or more chains of chemical building blocks called amino acids that are linked together by chemical bonds called peptide bonds.

[0214] An “enzyme” means any substance, preferably composed wholly or largely of protein, that catalyzes or promotes, more or less specifically, one or more chemical or biochemical reactions. The term “enzyme” can also refer to a catalytic polynucleotide (e.g. RNA or DNA).

[0215] A “native” or “wild-type” protein, enzyme, polynucleotide, gene, or cell, means a protein, enzyme, polynucleotide, gene, or cell that occurs in nature.

[0216] A “parent” protein, enzyme, polynucleotide, gene, or cell, is any protein, enzyme, polynucleotide, gene, or cell, from which any other protein, enzyme, polynucleotide, gene, or cell, is derived or made, using any methods, tools or techniques, and whether or not the parent is itself native or mutant. A parent polynucleotide or gene encodes for a parent protein or enzyme.

[0217] A “mutant”, “variant” or “modified” protein, enzyme, polynucleotide, gene, or cell, means a protein, enzyme, polynucleotide, gene, or cell, that has been altered or derived, or is in some way different or changed, from a parent protein, enzyme, polynucleotide, gene, or cell. A mutant or modified protein or enzyme is usually, although not necessarily, expressed from a mutant polynucleotide or gene.

[0218] A “mutation” means any process or mechanism resulting in a mutant protein, enzyme, polynucleotide, gene, or cell. This includes any mutation in which a protein, enzyme, polynucleotide, or gene sequence is altered, and any detectable change in a cell arising from such a mutation. Typically, a mutation occurs in a polynucleotide or gene sequence, by point mutations, deletions, or insertions of single or multiple nucleotide residues. A mutation includes polynucleotide alterations arising within a protein-encoding region of a gene as well as alterations in regions outside of a protein-encoding sequence, such as, but not limited to, regulatory or promoter sequences. A mutation in a gene can be “silent”, i.e., not reflected in an amino acid alteration upon expression, leading to a “sequence-conservative” variant of the gene. This generally arises when one amino acid corresponds to more than one codon. Table 9 outlines which amino acids correspond to which codon(s). TABLE 9 Amino Acids, Corresponding Codons, and Functionality Amino Acid SLC DNA codons Side Chain Property Isoleucine I ATT, ATC, ATA Hydrophobic Leucine L CTT, CTC, CTA, CTG, TTA, TTG Hydrophobic Valine V GTT, GTC, GTA, GTG Hydrophobic Phenylalanine F TTT, TTC Aromatic side chain Methionine M ATG Sulphur group Cysteine C TGT, TGC Sulphur group Alanine A GCT, GCC, GCA, GCG Hydrophobic Glycine G GGT, GGC, GGA, GGG Hydrophobic Proline P CCT, CCC, CCA, CCG Secondary amine Threonine T ACT, ACC, ACA, ACG Aliphatic hydroxyl Serine S TCT, TCC, TCA, TCG, AGT, AGC Aliphatic hydroxyl Tyrosine T TAT, TAC Aromatic side chain Tryptophan W TGG Aromatic side chain Glutamine Q CAA, CAG Amide group Asparagine N AAT, AAC Amide group Histidine H CAT, CAC Basic side chain Glutamic acid E GAA, GAG Acidic side chain Aspartic acid D GAT, GAC Acidic side chain Lysine K AAA, AAG Basic side chain Arginine R CGT, CGC, CGA, CGG, AGA, AGG Basic side chain Stop codons Stop TAA, TAG, TGA

[0219] “Function-conservative variants” are proteins or enzymes in which a given amino acid residue has been changed without altering overall conformation and function of the protein or enzyme, including, but not limited to, replacement of an amino acid with one having similar properties, including polar or non-polar character, size, shape and charge (see Table 9).

[0220] Amino acids other than those indicated as conserved may differ in a protein or enzyme so that the percent protein or amino acid sequence similarity between any two proteins of similar function may vary and can be, for example, at least 70%, preferably at least 80%, more preferably at least 90%, and most preferably at least 95%, as determined according to an alignment scheme. As referred to herein, “sequence similarity” means the extent to which nucleotide or protein sequences are related. The extent of similarity between two sequences can be based on percent sequence identity and/or conservation. “Sequence identity” herein means the extent to which two nucleotide or amino acid sequences are invariant. “Sequence alignment” means the process of lining up two or more sequences to achieve maximal levels of identity (and, in the case of amino acid sequences, conservation) for the purpose of assessing the degree of similarity. Numerous methods for aligning sequences and assessing similarity/identity are known in the art such as, for example, the Cluster Method, wherein similarity is based on the MEGALIGN algorithm, as well as BLASTN, BLASTP, and FASTA. When using all of these programs, the preferred settings are those that results in the highest sequence similarity.

[0221] The “activity” of an enzyme is a measure of its ability to catalyze a reaction, i.e., to “function”, and may be expressed as the rate at which the product of the reaction is produced. For example, enzyme activity can be represented as the amount of product produced per unit of time or per unit of enzyme (e.g., concentration or weight), or in terms of affinity or dissociation constants. Preferred activity units for expressing activity include the catalytic constant (k_(cat)=v_(max)/E; Vmax is maximal turnover rate; E is concentration of enzyme); the Michaelis-Menten constant (K_(m)); and k_(cat)/K_(m). Such units can be determined using well-established methods in the art of enzymes.

[0222] The “stability” or “resistance” of an enzyme means its ability to function, over time, in a particular environment or under particular conditions. One way to evaluate stability or resistance is to assess its ability to resist a loss of activity over time, under given conditions. Enzyme stability can also be evaluated in other ways, for example, by determining the relative degree to which the enzyme is in a folded or unfolded state. Thus, one enzyme has improved stability or resistance over another enzyme when it is more resistant than the other enzyme to a loss of activity under the same conditions, is more resistant to unfolding, or is more durable by any suitable measure. For example, a more “organic-solvent” resistant enzyme is one that is more resistant to loss of structure (unfolding) or function (enzyme activity) when exposed to an organic solvent or co-solvent.

[0223] The term “substrate” means any substance or compound that is converted or meant to be converted into another compound by the action of an enzyme catalyst. The term includes aromatic and aliphatic compounds, and includes not only a single compound, but also combinations of compounds, such as solutions, mixtures and other materials which contain at least one substrate. Preferred substrates for hydroxylation using the cytochrome P450 enzymes of the invention include alkanes such as propane, ethane, butane, pentane, hexane, cyclohexane, and octane, and alkane derivatives such as alkanes substituted with one or more chemical group such as nitro-, sulphur-, halogen- and oxygen-containing groups, as well as aromatic groups, e.g., p-nitrophenoxyoctane (8-pnpane). Preferred substrates for epoxidation include alkenes such as propene, ethene, butene, pentene, hexene, cyclohexene, octene, as well as alkene derivatives, which are alkanes substituted with or linked to chemical substituents such as nitro-, sulphur-, halogen- and oxygen-containing groups, and/or aromatic groups.

[0224] The term “cofactor” means any non-protein substance that is necessary or beneficial to the activity of an enzyme. A “coenzyme” means a cofactor that interacts directly with and serves to promote a reaction catalyzed by an enzyme. Many coenzymes serve as carriers. For example, NAD+ and NADP+ carry hydrogen atoms from one enzyme to another. An “ancillary protein” means any protein substance that is necessary or beneficial to the activity of an enzyme.

[0225] The terms “oxygen donor”, “oxidizing agent” and “oxidant” mean a substance, molecule or compound which donates oxygen to a substrate in an oxidation reaction. Typically, the oxygen donor is reduced (accepts electrons). Exemplary oxygen donors, which are not limiting, include molecular oxygen or dioxygen (O₂) and peroxides, including alkyl peroxides such as t-butyl peroxide, and most preferably hydrogen peroxide (H₂O₂). A peroxide is any compound having two oxygen atoms bound to each other by a single bond, i.e., dioxygen (O₂) has a double bond between the oxygen atoms.

[0226] An “oxidation enzyme” is an enzyme that catalyzes one or more oxidation reactions, typically by adding, inserting, contributing or transferring oxygen from a source or donor to a substrate. Such enzymes are also called oxidoreductases or redox enzymes, and encompasses oxygenases, hydrogenases or reductases, oxidases and peroxidases. An “oxidase” is an oxidation enzyme that catalyzes a reaction in which molecular oxygen (dioxygen or O₂) is reduced, for example by donating electrons to (or receiving protons from) hydrogen.

[0227] A “luminescent” substance means any substance which produces detectable electromagnetic radiation, or a change in electromagnetic radiation, most notably visible light, by any mechanism, including color change, UV absorbance, fluorescence and phosphorescence. Preferably, a luminescent substance according to the invention produces a detectable color, fluorescence or UV absorbance. The term “chemiluminescent agent” means any substance which enhances the detectability of a luminescent (e.g., fluorescent) signal, for example by increasing the strength or lifetime of the signal.

[0228] A “polynucleotide” or “nucleotide sequence” is a series of nucleotide bases (also called “nucleotides”) in DNA and RNA, and means any chain of two or more nucleotides. A nucleotide sequence typically carries genetic information, including the information used by cellular machinery to make proteins and enzymes. These terms include double or single stranded genomic and cDNA, RNA, any synthetic and genetically manipulated polynucleotide, and both sense and anti-sense polynucleotide (although only sense stands are being represented herein). This includes single- and double-stranded molecules, i.e., DNA-DNA, DNA-RNA and RNA-RNA hybrids, as well as “protein nucleic acids” (PNA) formed by conjugating bases to an amino acid backbone. This also includes nucleic acids containing modified bases, for example thio-uracil, thio-guanine and fluoro-uracil.

[0229] The polynucleotides herein may be flanked by natural regulatory sequences, or may be associated with heterologous sequences, including promoters, enhancers, response elements, signal sequences, polyadenylation sequences, introns, 5′- and 3′-non-coding regions, and the like. The nucleic acids may also be modified by many means known in the art. Non-limiting examples of such modifications include methylation, “caps”, substitution of one or more of the naturally occurring nucleotides with an analog, and internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.).

[0230] A “coding sequence” or a sequence “encoding” a polypeptide, protein or enzyme is a nucleotide sequence that, when expressed, results in the production of that polypeptide, protein or enzyme, i.e., the nucleotide sequence encodes an amino acid sequence for that polypeptide, protein or enzyme. A coding sequence is “under the control” of transcriptional and translational control sequences in a cell when RNA polymerase transcribes the coding sequence into mRNA, which is then trans-RNA spliced and translated into the protein encoded by the coding sequence. Preferably, the coding sequence is a double-stranded DNA sequence which is transcribed and translated into a polypeptide in a cell in vitro or in vivo when placed under the control of appropriate regulatory sequences. The boundaries of the coding sequence are determined by a start codon at the 5′ (amino) terminus and a translation stop codon at the 3′ (carboxyl) terminus. A coding sequence can include, but is not limited to, prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. If the coding sequence is intended for expression in a eukaryotic cell, a polyadenylation signal and transcription termination sequence will usually be located 3′ to the coding sequence.

[0231] The term “gene”, also called a “structural gene” means a DNA sequence that codes for or corresponds to a particular sequence of amino acids which comprise all or part of one or more proteins or enzymes, and may or may not include regulatory DNA sequences, such as promoter sequences, which determine for example the conditions under which the gene is expressed. Some genes, which are not structural genes, may be transcribed from DNA to RNA, but are not translated into an amino acid sequence. Other genes may function as regulators of structural genes or as regulators of DNA transcription. A gene encoding a protein of the invention for use in an expression system, whether genomic DNA or cDNA, can be isolated from any source, particularly from a human cDNA or genomic library. Methods for obtaining genes are well known in the art, e.g., Sambrook et al (supra).

[0232] A “promoter sequence” is a DNA regulatory region capable of binding RNA polymerase in a cell and initiating transcription of a downstream (3′ direction) coding sequence. For purposes of defining this invention, the promoter sequence is bounded at its 3′ terminus by the transcription initiation site and extends upstream (5′ direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background.

[0233] Polynucleotides are “hybridizable” to each other when at least one strand of one polynucleotide can anneal to another polynucleotide under defined stringency conditions. Stringency of hybridization is determined, e.g., by (a) the temperature at which hybridization and/or washing is performed, and (b) the ionic strength and polarity (e.g., formamide) of the hybridization and washing solutions, as well as other parameters. Hybridization requires that the two polynucleotides contain substantially complementary sequences; depending on the stringency of hybridization, however, mismatches may be tolerated. Typically, hybridization of two sequences at high stringency (such as, for example, in an aqueous solution of 0.5×SSC at 65° C.) requires that the sequences exhibit some high degree of complementarity over their entire sequence. Conditions of intermediate stringency (such as, for example, an aqueous solution of 2×SSC at 65° C.) and low stringency (such as, for example, an aqueous solution of 2×SSC at 55° C.), require correspondingly less overall complementarity between the hybridizing sequences. (1×SSC is 0.15 M NaCl, 0.015 M Na citrate.) Polynucleotides that hybridize include those which anneal under suitable stringency conditions and which encode polypeptides or enzymes having the same function, such as the ability to catalyze an oxidation, oxygenase, or coupling reaction of the invention.

[0234] The term “expression system” means a host cell and compatible vector under suitable conditions, e.g. for the expression of a protein coded for by foreign DNA carried by the vector and introduced to the host cell. Common expression systems include bacteria (e.g., E. coli and B. subtilis) or yeast (e.g., S. cerevisiae) host cells and plasmid vectors, and insect host cells and Baculovirus vectors. As used herein, a “facile expression system” means any expression system that is foreign or heterologous to a selected polynucleotide or polypeptide, and which employs host cells that can be grown or maintained more advantageously than cells that are native or heterologous to the selected polynucleotide or polypeptide, or which can produce the polypeptide more efficiently or in higher yield. For example, the use of robust prokaryotic cells to express a protein of eukaryotic origin would be a facile expression system. Preferred facile expression systems include E. coli, B. subtilis and S. cerevisiae host cells and any suitable vector.

[0235] The term “transformation” means the introduction of a foreign (i.e., extrinsic or extracellular) gene, DNA or RNA sequence to a host cell, so that the host cell will express the introduced gene or sequence to produce a desired substance, typically a protein or enzyme coded by the introduced gene or sequence. The introduced gene or sequence may include regulatory or control sequences, such as start, stop, promoter, signal, secretion, or other sequences used by the genetic machinery of the cell. A host cell that receives and expresses introduced DNA or RNA has been “transformed” and is a “transformant” or a “clone.” The DNA or RNA introduced to a host cell can come from any source, including cells of the same genus or species as the host cell, or cells of a different genus or species.

[0236] The terms “vector”, “vector construct” and “expression vector” mean the vehicle by which a DNA or RNA sequence (e.g. a foreign gene) can be introduced into a host cell, so as to transform the host and promote expression (e.g. transcription and translation) of the introduced sequence. Vectors typically comprise the DNA of a transmissible agent, into which foreign DNA encoding a protein is inserted by restriction enzyme technology. A common type of vector is a “plasmid”, which generally is a self-contained molecule of double-stranded DNA, that can readily accept additional (foreign) DNA and which can readily introduced into a suitable host cell. A large number of vectors, including plasmid and fungal vectors, have been described for replication and/or expression in a variety of eukaryotic and prokaryotic hosts. Non-limiting examples include pKK plasmids (Clonetech), pUC plasmids, pET plasmids (Novagen, Inc., Madison, Wis.), pRSET or pREP plasmids (Invitrogen, San Diego, Calif.), or pMAL plasmids (New England Biolabs, Beverly, Mass.), and many appropriate host cells, using methods disclosed or cited herein or otherwise known to those skilled in the relevant art. Recombinant cloning vectors will often include one or more replication systems for cloning or expression, one or more markers for selection in the host, e.g., antibiotic resistance, and one or more expression cassettes. Preferred vectors are described in the Examples, and include without limitations pcWori, pET-26b(+), pXTD14, pYEX-S1, pMAL, and pET22-b(+). Other vectors may be employed as desired by one skilled in the art. Routine experimentation in biotechnology can be used to determine which vectors are best suited for used with the invention, if different than as described in the Examples. In general, the choice of vector depends on the size of the polynucleotide sequence and the host cell to be employed in the methods of this invention.

[0237] The terms “express” and “expression” mean allowing or causing the information in a gene or DNA sequence to become manifest, for example producing a protein by activating the cellular functions involved in transcription and translation of a corresponding gene or DNA sequence. A DNA sequence is expressed in or by a cell to form an “expression product” such as a protein. The expression product itself, e.g. the resulting protein, may also be said to be “expressed” by the cell. A polynucleotide or polypeptide is expressed recombinantly, for example, when it is expressed or produced in a foreign host cell under the control of a foreign or native promoter, or in a native host cell under the control of a foreign promoter.

[0238] A polynucleotide or polypeptide is “over-expressed” when it is expressed or produced in an amount or yield that is substantially higher than a given base-line yield, e.g. a yield that occurs in nature. For example, a polypeptide is over-expressed when the yield is substantially greater than the normal, average or base-line yield of the native polypolypeptide in native host cells under given conditions, for example conditions suitable to the life cycle of the native host cells.

[0239] “Isolation” or “purification” of a polypeptide or enzyme refers to the derivation of the polypeptide by removing it from its original environment (for example, from its natural environment if it is naturally occurring, or form the host cell if it is produced by recombinant DNA methods). Methods for polypeptide purification are well-known in the art, including, without limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, and countercurrent distribution. For some purposes, it is preferable to produce the polypeptide in a recombinant system in which the protein contains an additional sequence tag that facilitates purification, such as, but not limited to, a polyhistidine sequence. The polypeptide can then be purified from a crude lysate of the host cell by chromatography on an appropriate solid-phase matrix. Alternatively, antibodies produced against the protein or against peptides derived therefrom can be used as purification reagents. Other purification methods are possible. A purified polynucleotide or polypeptide may contain less than about 50%, preferably less than about 75%, and most preferably less than about 90%, of the cellular components with which it was originally associated. A “substantially pure” enzyme indicates the highest degree of purity which can be achieved using conventional purification techniques known in the art.

BIBLIOGRAPHY

[0240] Adam W, et al. Eur J Org Chem 2000;16:2923-2926.

[0241] Appel D, et al. J Biotechnol 2001;88:167-171.

[0242] Appel D, et al. J Bacteriol 2001;88,:167-171.

[0243] Arnold F H. Acc Chem Res 1998;3:125-131.

[0244] Ashraf W, et al. FEMS Microbiol Lett 1994;122:1-6.

[0245] Barnes H J. In: Methods in Enzymology, Volume 272, (Eds.: Johnson E F and Waterman M R) Academic Press, San Diego, 1996, pp 3-17.

[0246] Benson S, et al. J Bacteriol 1979;140:754-762.

[0247] Beratan, DNT. Protein Electron Transfer, 1996, Oxford: Bios Scientific Publishers.

[0248] Boddupalli S S, et al. J Biol Chem 1990;265:4233-4239.

[0249] Boddupalli S S, et al. Arch Biochem Biophys 1992;292:20-28.

[0250] Capdevila J H, et al. J Biol Chem 1996;271:22663-22671.

[0251] Carmichael A B, and Wong, L L. Eur J Biochem 2001;268:3117-3125.

[0252] Chang Y T and Loew G. Biochemistry 2000;39:2484-2498.

[0253] Chen H Y, et al. Science 2000;287:1995-1997. de Montellano (Ed.), “Cytochrome P450; Structure, Mechanism, and Biochemistry, 2nd Ed., Plenum Press, New York (1995).

[0254] Dordick J S. Biotech Progr 1992;8:259-267.

[0255] Farinas E, et al. Adv Syn Catal 2001;343:601-606.

[0256] Fisher M B, et al. Biochem Biophys Res Commun 1998;248:352-355.

[0257] Fox B G, et al. Methods Enzymol 1990;188:191-202.

[0258] Fruetel J A, et al. J Biol Chem 1994;269:28815-28821.

[0259] Graham-Lorence S E, et al. J Biol Chem 1997:272:1127 - 1135.

[0260] Green J and Dalton H. J Biol Chem 1989;264:17698-17703.

[0261] Groves J T and Han Y -H. In: Cytochrome P450: Structure, Mechanism, and Biochemistry (Ed.: Ortiz de Montellano, P. R. ), Plenum Press, New York, N.Y., 1995, pp. 3-48.

[0262] Haines D C, et al. Biochemistry 2001;40:13456-13465.

[0263] Hartmann M, and Ernst S. Angew Chem Int Ed 2000;39:888-890.

[0264] Joo H, et al. Nature 1999;399:670-673.

[0265] Kim and Thomas, Bull Environ Contam Toxicol 1992;49:879-885.

[0266] Klibanov A M. Proc Natl Acad Sci 1997;92:10969-10976.

[0267] Kuhn-Velten, W N. Z. Naturforsch 1997;52:132-135.

[0268] Kvittingen L, et al. Tetrahedron 1992;48:2793-2802.

[0269] Leadbetter E R, and Foster J W. Nature 1959;184:1428-1429.

[0270] Lewis D F V. Cytochromes P450: Structure, Function and Mechanism. 1996, London: Taylor & Francis.

[0271] Li Q, et al. Biochim Biophys Acta 2001;1545:114-121.

[0272] Li H, and Poulos T L. Nature Struct Biol 1997;4:140-146.

[0273] March, J. In: Advanced Organic Chemistry: Reactions, Mechanisms, and Structure 4th edition, John Wiley and Sons, New York, 1992b, pp. 1072-1074.

[0274] March, J. In: Advanced Organic Chemistry: Reactions, Mechanisms, and Structure 4th edition, John Wiley and Sons, New York, 1992a, pp. 882-884.

[0275] Martinez and Stewart, Curr Org Chem 2000;4:263-282.

[0276] Miles C S, et al. Biochim Biophys Acta 2000;1543:383-407.

[0277] Matson R S, et al. Biochem Biophys Acta 1977;487:487-494.

[0278] Miura Y and Fulco A J. Biochim Biophys Acta 1975;388,:305-317.

[0279] Modi S, et al. Biochemistry 1995;34:8982-8988.

[0280] Moser C C. et al. J Bioenerg Biomembr 1995;27:263-274.

[0281] Munro A W, et al. Biochem Soc Trans 1993;21:412S.

[0282] Munro A W, et al. Eur J Biochem 1996;239:403-409.

[0283] Murrell J C, et al. Arch. Microbiol 2000;173:325-332.

[0284] Narhi L O, and Fulco, A J. J Biol Chem 1986;261:7160-7169.

[0285] Narhi L O, and Fulco, A J. J Biol Chem 1987;262:6683-6690.

[0286] Noble M A, et al. Biochem J 1999;339:371-379.

[0287] Oliver C F, et al. Biochem J 1997;327:537-544.

[0288] Omura T, and Sato R J. J Biol Chem 1964;239:2370-2378.

[0289] Ortiz de Montellano P R. Cytochrome P450: structure, mechanism, and biochemistry (ed. Ortiz de Montellano, PR) (Plenum Press, New York, 1995).

[0290] Ost T W, et al. FEBS Lett 2000;486:173-177.

[0291] Paulsen M D and Ornstein R L. Proteins 1995;21:237-243.

[0292] Peterson J A and Graham-Lorence S E, “Bacterial P450s: Structural Similarities and Functional Differences”. In: Cytochrome P450: Structure, Mechanism, and Biochemistry. 2nd Ed., edited by Ortiz de Montellano, PR. Plenum Press, New York, 1995.

[0293] Ruettinger R T and Fulco A J. J Biol Chem 1981;256:5728-5734.

[0294] Ruettinger R T, et al. J Biol Chem 1989;264:10987-10995.

[0295] Scheller U, et al. Arch Biochem Biophys 1996;328:245-254.

[0296] Schmid et al., Nature 2001;409:258-268.

[0297] Schmid A, et al. J Biol Chem 1989;264:10023-10033.

[0298] Schneider S, et al. Biotechnol Bioeng 1999;64:333-341.

[0299] Schwaneberg U, et al. J Biomol Screening 2001;6:111-117.

[0300] Schwaneberg U, et al. Anal Biochem 1999a;269:359-366.

[0301] Schwaneberg U, et al. J Chromatogr A 1999b;848:149-159.

[0302] Shanklin J, et al. Proc Natl Acad Sci 1997;94:2981-2986.

[0303] Shilov A E and Shul'pin G B. Chem. Rev., 1997, 97, 2879-2932.

[0304] Staijen I E, et al. Eur J Biochem 2000;267:1957-1965.

[0305] Stevenson J A, et al. J Am Chem Soc 1996; 118:12846-12847.

[0306] Thomas J M, et al. Ace Chem Res 2001;34:191-200.

[0307] Tonge G M, et al. Biochem J 1977;1161:333-344.

[0308] Watkinson R J, and Morgan P. Biodegradation 1990;1:79-92.

[0309] Yeom, H. & Sligar, S. G. Oxygen activation by cytochrome P45OBM-3: effects of mutating an active site acidic residue. Arch Biochem Biophys 337, 209-216. (1997)

[0310] Zhao H M, et al. Nature Biotechnol 1998;16:258-261.

[0311] Zhao H, et al. In: Manual of Industrial Microbiology and Biotechnology 2nd Edition (Eds.: Demain and Davies), ASM Press, Washington D. C., 1999, pp. 597-604.

[0312] Zimmer T, et al. Biochem Biophys Res Commun 1996;224:784-789.

0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 42 <210> SEQ ID NO 1 <211> LENGTH: 4957 <212> TYPE: DNA <213> ORGANISM: Bacillus megaterium <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: GenBank / J04832 <309> DATABASE ENTRY DATE: 1996-02-13 <313> RELEVANT RESIDUES: (1)..(4957) <400> SEQUENCE: 1 agatctttat gaagacatag ctgcagaaga aaaagcaaga gctacatatc aatggttaat 60 tgatatatca gatgatcccg atttaaacga cagcttacga tttttacgag aaagagagat 120 tgttcactca cagcggttcc gcgaggccgt ggagatttta aaagatgaca gagacaggaa 180 gaaaatcttt taactagtaa aaaaacatcc cccttggcga atgcaaacga aaggagggat 240 gttttttgtt gtgactgcgt tgattatgcg ctagaactgc agtgacaaga aacaaccttt 300 aatttccctt caacatcttt ccaaactcgc gtataactgt attcacctcc aatagattca 360 ccggttgcca gtgccccatt taacgctact tttgtaacgg taacggcaag ttcttgaaac 420 agtttaactt cttgttccaa cacttccatg cccgctatat caagactttt tgaacgatga 480 acatttatat cttcttcttt tgacaaccat tgcccaaggt gattcacaaa aataagctca 540 tctgaaagta attcttctaa tagctctatg ttattagaaa gcatggctga gcgaagcatt 600 tcttcgtatt ctataactct tgcttgattc atttttaatc ctcctttacg ccttgtgtaa 660 ctcttttcta tttccacgtt gcttttcctt taaacttctt tcattaataa ttcgtgctaa 720 attatgttaa tagaggggat aagtggacta attttctgta agcactaaat attctgaaat 780 actctgttaa ttacctttaa atggtataaa attagaatga aagaaccttt tctttccact 840 tttctagtta tctttttact attaagatgc agttttttat acttgtaatt gtagcggaat 900 gaacgttcat tccgtttttg aaaagaggtg ataaagtgga atctactcca acaaaacaaa 960 aagcgatttt ttctgcttcg cttctgctgt ttgcagaaag agggtttgat gcaaccacga 1020 tgccaatgat tgcagagaat gccaaagtag gagcaggaac aatttatcgc tactttaaaa 1080 ataaagaaag ccttgtaaat gaattattcc aacagcacgt aaacgagttt ttacagtgca 1140 ttgaaagcgg tctggcaaac gagagagatg gataccgaga tgggtttcat catatctttg 1200 aaggtatggt gacatttact aaaaaccatc ctcgtgctct tggatttatt aaaactcata 1260 gccaaggaac ttttttaaca gaagagagcc gcttagcata tcaaaagctg gtggaatttg 1320 tttgtacgtt cttcagagaa ggacaaaagc aaggtgtgat tagaaatctt cctgaaaatg 1380 cgctaattgc tattttattt ggaagtttca tggaagtata tgaaatgatt gaaaatgact 1440 acttatcttt aactgatgaa cttcttaccg gtgtagaaga gagtctgtgg gcagcactta 1500 gcagacaatc atgaaactta acaagtgaaa gagggataac atgacaatta aagaaatgcc 1560 tcagccaaaa acgtttggag agcttaaaaa tttaccgtta ttaaacacag ataaaccggt 1620 tcaagctttg atgaaaattg cggatgaatt aggagaaatc tttaaattcg aggcgcctgg 1680 tcgtgtaacg cgctacttat caagtcagcg tctaattaaa gaagcatgcg atgaatcacg 1740 ctttgataaa aacttaagtc aagcgcttaa atttgtacgt gattttgcag gagacgggtt 1800 atttacaagc tggacgcatg aaaaaaattg gaaaaaagcg cataatatct tacttccaag 1860 cttcagtcag caggcaatga aaggctatca tgcgatgatg gtcgatatcg ccgtgcagct 1920 tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt gaagtaccgg aagacatgac 1980 acgtttaacg cttgatacaa ttggtctttg cggctttaac tatcgcttta acagctttta 2040 ccgagatcag cctcatccat ttattacaag tatggtccgt gcactggatg aagcaatgaa 2100 caagctgcag cgagcaaatc cagacgaccc agcttatgat gaaaacaagc gccagtttca 2160 agaagatatc aaggtgatga acgacctagt agataaaatt attgcagatc gcaaagcaag 2220 cggtgaacaa agcgatgatt tattaacgca tatgctaaac ggaaaagatc cagaaacggg 2280 tgagccgctt gatgacgaga acattcgcta tcaaattatt acattcttaa ttgcgggaca 2340 cgaaacaaca agtggtcttt tatcatttgc gctgtatttc ttagtgaaaa atccacatgt 2400 attacaaaaa gcagcagaag aagcagcacg agttctagta gatcctgttc caagctacaa 2460 acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac gaagcgctgc gcttatggcc 2520 aactgctcct gcgttttccc tatatgcaaa agaagatacg gtgcttggag gagaatatcc 2580 tttagaaaaa ggcgacgaac taatggttct gattcctcag cttcaccgtg ataaaacaat 2640 ttggggagac gatgtggaag agttccgtcc agagcgtttt gaaaatccaa gtgcgattcc 2700 gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg tgtatcggtc agcagttcgc 2760 tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa cactttgact ttgaagatca 2820 tacaaactac gagctggata ttaaagaaac tttaacgtta aaacctgaag gctttgtggt 2880 aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct tcacctagca ctgaacagtc 2940 tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat acgccgctgc ttgtgctata 3000 cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat ttagcagata ttgcaatgag 3060 caaaggattt gcaccgcagg tcgcaacgct tgattcacac gccggaaatc ttccgcgcga 3120 aggagctgta ttaattgtaa cggcgtctta taacggtcat ccgcctgata acgcaaagca 3180 atttgtcgac tggttagacc aagcgtctgc tgatgaagta aaaggcgttc gctactccgt 3240 atttggatgc ggcgataaaa actgggctac tacgtatcaa aaagtgcctg cttttatcga 3300 tgaaacgctt gccgctaaag gggcagaaaa catcgctgac cgcggtgaag cagatgcaag 3360 cgacgacttt gaaggcacat atgaagaatg gcgtgaacat atgtggagtg acgtagcagc 3420 ctactttaac ctcgacattg aaaacagtga agataataaa tctactcttt cacttcaatt 3480 tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac ggtgcgtttt caacgaacgt 3540 cgtagcaagc aaagaacttc aacagccagg cagtgcacga agcacgcgac atcttgaaat 3600 tgaacttcca aaagaagctt cttatcaaga aggagatcat ttaggtgtta ttcctcgcaa 3660 ctatgaagga atagtaaacc gtgtaacagc aaggttcggc ctagatgcat cacagcaaat 3720 ccgtctggaa gcagaagaag aaaaattagc tcatttgcca ctcgctaaaa cagtatccgt 3780 agaagagctt ctgcaatacg tggagcttca agatcctgtt acgcgcacgc agcttcgcgc 3840 aatggctgct aaaacggtct gcccgccgca taaagtagag cttgaagcct tgcttgaaaa 3900 gcaagcctac aaagaacaag tgctggcaaa acgtttaaca atgcttgaac tgcttgaaaa 3960 atacccggcg tgtgaaatga aattcagcga atttatcgcc cttctgccaa gcatacgccc 4020 gcgctattac tcgatttctt catcacctcg tgtcgatgaa aaacaagcaa gcatcacggt 4080 cagcgttgtc tcaggagaag cgtggagcgg atatggagaa tataaaggaa ttgcgtcgaa 4140 ctatcttgcc gagctgcaag aaggagatac gattacgtgc tttatttcca caccgcagtc 4200 agaatttacg ctgccaaaag accctgaaac gccgcttatc atggtcggac cgggaacagg 4260 cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag ctaaaagaac aaggacagtc 4320 acttggagaa gcacatttat acttcggctg ccgttcacct catgaagact atctgtatca 4380 agaagagctt gaaaacgccc aaagcgaagg catcattacg cttcataccg ctttttctcg 4440 catgccaaat cagccgaaaa catacgttca gcacgtaatg gaacaagacg gcaagaaatt 4500 gattgaactt cttgatcaag gagcgcactt ctatatttgc ggagacggaa gccaaatggc 4560 acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac gttcaccaag tgagtgaagc 4620 agacgctcgc ttatggctgc agcagctaga agaaaaaggc cgatacgcaa aagacgtgtg 4680 ggctgggtaa attaaaaaga ggctaggata aaagtagttt agttggttga aggaagatcc 4740 gaacgatgaa tcgttcggat ctttttattg gtagagtaaa cgtagatttc atctatttag 4800 tgacttgtag cggttgattg gagggcaagg tgaagactcc aatcaaccgc ggtgtcacat 4860 gcaagccata cgaaattcat ttctcccatt tattcgtctt ttgtccccac ttaattttta 4920 tagcgcctta acgtttcttc tgcgtgacag cagatct 4957 <210> SEQ ID NO 2 <211> LENGTH: 1049 <212> TYPE: PRT <213> ORGANISM: Bacillus megaterium <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: GenBank / P14779 <309> DATABASE ENTRY DATE: 1990-04-01 <313> RELEVANT RESIDUES: (1)..(1049) <400> SEQUENCE: 2 Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys 1 5 10 15 Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys 20 25 30 Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg 35 40 45 Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp 50 55 60 Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg 65 70 75 80 Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 85 90 95 Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala 100 105 110 Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val 115 120 125 Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu 130 135 140 Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn 145 150 155 160 Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr 165 170 175 Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala 180 185 190 Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu 195 200 205 Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg 210 215 220 Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn 225 230 235 240 Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg 245 250 255 Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly 260 265 270 Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 275 280 285 Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 290 295 300 Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn 305 310 315 320 Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 325 330 335 Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 340 345 350 Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp 355 360 365 Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 370 375 380 Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala 385 390 395 400 Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 405 410 415 Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 420 425 430 Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 435 440 445 Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr 450 455 460 Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 465 470 475 480 Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 485 490 495 Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro 500 505 510 Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 515 520 525 Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 530 535 540 Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val 545 550 555 560 Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 565 570 575 Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala 580 585 590 Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 595 600 605 Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 610 615 620 Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys 625 630 635 640 Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu 645 650 655 Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 660 665 670 Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu 675 680 685 Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile 690 695 700 Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly 705 710 715 720 Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu 725 730 735 Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln 740 745 750 Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met 755 760 765 Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 770 775 780 Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr 785 790 795 800 Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 805 810 815 Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile 820 825 830 Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser 835 840 845 Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile 850 855 860 Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys 865 870 875 880 Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 885 890 895 Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 900 905 910 Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu 915 920 925 Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 930 935 940 Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr 945 950 955 960 Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val 965 970 975 Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp 980 985 990 Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro 995 1000 1005 Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln 1010 1015 1020 Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu 1025 1030 1035 Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly 1040 1045 <210> SEQ ID NO 3 <211> LENGTH: 489 <212> TYPE: PRT <213> ORGANISM: Oryctolagus cuniculus <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: GenBank / P00182 <309> DATABASE ENTRY DATE: 1998-12-15 <313> RELEVANT RESIDUES: (1)..(489) <400> SEQUENCE: 3 Met Asp Leu Leu Ile Ile Leu Gly Ile Cys Leu Ser Cys Val Val Leu 1 5 10 15 Leu Ser Leu Trp Lys Lys Thr His Gly Lys Gly Lys Leu Pro Pro Gly 20 25 30 Pro Thr Pro Leu Pro Val Val Gly Asn Leu Leu Gln Leu Glu Thr Lys 35 40 45 Asp Ile Asn Lys Ser Leu Ser Met Leu Ala Lys Glu Tyr Gly Ser Ile 50 55 60 Phe Thr Leu Tyr Phe Gly Met Lys Pro Ala Val Val Leu Tyr Gly Tyr 65 70 75 80 Glu Gly Val Ile Glu Ala Leu Ile Asp Arg Gly Glu Glu Phe Ser Gly 85 90 95 Arg Gly Ile Phe Pro Val Phe Asp Arg Val Thr Lys Gly Leu Gly Ile 100 105 110 Val Phe Ser Ser Gly Glu Lys Trp Lys Glu Thr Arg Arg Phe Ser Leu 115 120 125 Thr Val Leu Arg Asn Leu Gly Met Gly Lys Lys Thr Ile Glu Glu Arg 130 135 140 Ile Gln Glu Glu Ala Leu Cys Leu Ile Gln Ala Leu Arg Lys Thr Asn 145 150 155 160 Ala Ser Pro Cys Asp Pro Thr Phe Leu Leu Phe Cys Val Pro Cys Asn 165 170 175 Val Ile Cys Ser Val Ile Phe Gln Asn Arg Phe Asp Tyr Asp Asp Glu 180 185 190 Lys Phe Lys Thr Leu Ile Lys Tyr Phe His Glu Asn Phe Glu Leu Leu 195 200 205 Gly Thr Pro Trp Ile Gln Leu Tyr Asn Ile Phe Pro Ile Leu His Tyr 210 215 220 Leu Pro Gly Ser His Arg Gln Leu Phe Lys Asn Ile Asp Gly Gln Ile 225 230 235 240 Lys Phe Ile Leu Glu Lys Val Gln Glu His Gln Glu Ser Leu Asp Ser 245 250 255 Asn Asn Pro Arg Asp Phe Val Asp His Phe Leu Ile Lys Met Glu Lys 260 265 270 Glu Lys His Lys Lys Gln Ser Glu Phe Thr Met Asp Asn Leu Ile Thr 275 280 285 Thr Ile Trp Asp Val Phe Ser Ala Gly Thr Asp Thr Thr Ser Asn Thr 290 295 300 Leu Lys Phe Ala Leu Leu Leu Leu Leu Lys His Pro Glu Ile Thr Ala 305 310 315 320 Lys Val Gln Glu Glu Ile Glu His Val Ile Gly Arg His Arg Ser Pro 325 330 335 Cys Met Gln Asp Arg Thr Arg Met Pro Tyr Thr Asp Ala Val Met His 340 345 350 Glu Ile Gln Arg Tyr Val Asp Leu Val Pro Thr Ser Leu Pro His Ala 355 360 365 Val Thr Gln Asp Ile Glu Phe Asn Gly Tyr Leu Ile Pro Lys Gly Thr 370 375 380 Asp Ile Ile Pro Ser Leu Thr Ser Val Leu Tyr Asp Asp Lys Glu Phe 385 390 395 400 Pro Asn Pro Glu Lys Phe Asp Pro Gly His Phe Leu Asp Glu Ser Gly 405 410 415 Asn Phe Lys Lys Ser Asp Tyr Phe Met Pro Phe Ser Ala Gly Lys Arg 420 425 430 Ala Cys Val Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Leu Leu 435 440 445 Thr Thr Ile Leu Gln His Phe Thr Leu Lys Pro Leu Val Asp Pro Lys 450 455 460 Asp Ile Asp Pro Thr Pro Val Glu Asn Gly Phe Val Ser Val Pro Pro 465 470 475 480 Ser Tyr Glu Leu Cys Phe Val Pro Val 485 <210> SEQ ID NO 4 <211> LENGTH: 490 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: GenBank / P11712 <309> DATABASE ENTRY DATE: 1989-10-01 <313> RELEVANT RESIDUES: (1)..(490) <400> SEQUENCE: 4 Met Asp Ser Leu Val Val Leu Val Leu Cys Leu Ser Cys Leu Leu Leu 1 5 10 15 Leu Ser Leu Trp Arg Gln Ser Ser Gly Arg Gly Lys Leu Pro Pro Gly 20 25 30 Pro Thr Pro Leu Pro Val Ile Gly Asn Ile Leu Gln Ile Gly Ile Lys 35 40 45 Asp Ile Ser Lys Ser Leu Thr Asn Leu Ser Lys Val Tyr Gly Pro Val 50 55 60 Phe Thr Leu Tyr Phe Gly Leu Lys Pro Ile Val Val Leu His Gly Tyr 65 70 75 80 Glu Ala Val Lys Glu Ala Leu Ile Asp Leu Gly Glu Glu Phe Ser Gly 85 90 95 Arg Gly Ile Phe Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly Ile 100 105 110 Val Phe Ser Asn Gly Lys Lys Trp Lys Glu Ile Arg Arg Phe Ser Leu 115 120 125 Met Thr Leu Arg Asn Phe Gly Met Gly Lys Arg Ser Ile Glu Asp Arg 130 135 140 Val Gln Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys 145 150 155 160 Ala Ser Pro Cys Asp Pro Thr Phe Ile Leu Gly Cys Ala Pro Cys Asn 165 170 175 Val Ile Cys Ser Ile Ile Phe His Lys Arg Phe Asp Tyr Lys Asp Gln 180 185 190 Gln Phe Leu Asn Leu Met Glu Lys Leu Asn Glu Asn Ile Lys Ile Leu 195 200 205 Ser Ser Pro Trp Ile Gln Ile Cys Asn Asn Phe Ser Pro Ile Ile Asp 210 215 220 Tyr Phe Pro Gly Thr His Asn Lys Leu Leu Lys Asn Val Ala Phe Met 225 230 235 240 Lys Ser Tyr Ile Leu Glu Lys Val Lys Glu His Gln Glu Ser Met Asp 245 250 255 Met Asn Asn Pro Gln Asp Phe Ile Asp Cys Phe Leu Met Lys Met Glu 260 265 270 Lys Glu Lys His Asn Gln Pro Ser Glu Phe Thr Ile Glu Ser Leu Glu 275 280 285 Asn Thr Ala Val Asp Leu Phe Gly Ala Gly Thr Glu Thr Thr Ser Thr 290 295 300 Thr Leu Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr 305 310 315 320 Ala Lys Val Gln Glu Glu Ile Glu Arg Val Ile Gly Arg Asn Arg Ser 325 330 335 Pro Cys Met Gln Asp Arg Ser His Met Pro Tyr Thr Asp Ala Val Val 340 345 350 His Glu Val Gln Arg Tyr Ile Asp Leu Leu Pro Thr Ser Leu Pro His 355 360 365 Ala Val Thr Cys Asp Ile Lys Phe Arg Asn Tyr Leu Ile Pro Lys Gly 370 375 380 Thr Thr Ile Leu Ile Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu 385 390 395 400 Phe Pro Asn Pro Glu Met Phe Asp Pro His His Phe Leu Asp Glu Gly 405 410 415 Gly Asn Phe Lys Lys Ser Lys Tyr Phe Met Pro Phe Ser Ala Gly Lys 420 425 430 Arg Ile Cys Val Gly Glu Ala Leu Ala Gly Met Glu Leu Phe Leu Phe 435 440 445 Leu Thr Ser Ile Leu Gln Asn Phe Asn Leu Lys Ser Leu Val Asp Pro 450 455 460 Lys Asn Leu Asp Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro 465 470 475 480 Pro Phe Tyr Gln Leu Cys Phe Ile Pro Val 485 490 <210> SEQ ID NO 5 <211> LENGTH: 504 <212> TYPE: PRT <213> ORGANISM: Rattus norvegicus <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: GenBank / P10633 <309> DATABASE ENTRY DATE: 1989-07-01 <313> RELEVANT RESIDUES: (1)..(504) <400> SEQUENCE: 5 Met Glu Leu Leu Asn Gly Thr Gly Leu Trp Ser Met Ala Ile Phe Thr 1 5 10 15 Val Ile Phe Ile Leu Leu Val Asp Leu Met His Arg Arg His Arg Trp 20 25 30 Thr Ser Arg Tyr Pro Pro Gly Pro Val Pro Trp Pro Val Leu Gly Asn 35 40 45 Leu Leu Gln Val Asp Leu Ser Asn Met Pro Tyr Ser Leu Tyr Lys Leu 50 55 60 Gln His Arg Tyr Gly Asp Val Phe Ser Leu Gln Lys Gly Trp Lys Pro 65 70 75 80 Met Val Ile Val Asn Arg Leu Lys Ala Val Gln Glu Val Leu Val Thr 85 90 95 His Gly Glu Asp Thr Ala Asp Arg Pro Pro Val Pro Ile Phe Lys Cys 100 105 110 Leu Gly Val Lys Pro Arg Ser Gln Gly Val Ile Leu Ala Ser Tyr Gly 115 120 125 Pro Glu Trp Arg Glu Gln Arg Arg Phe Ser Val Ser Thr Leu Arg Thr 130 135 140 Phe Gly Met Gly Lys Lys Ser Leu Glu Glu Trp Val Thr Lys Glu Ala 145 150 155 160 Gly His Leu Cys Asp Ala Phe Thr Ala Gln Ala Gly Gln Ser Ile Asn 165 170 175 Pro Lys Ala Met Leu Asn Lys Ala Leu Cys Asn Val Ile Ala Ser Leu 180 185 190 Ile Phe Ala Arg Arg Phe Glu Tyr Glu Asp Pro Tyr Leu Ile Arg Met 195 200 205 Val Lys Leu Val Glu Glu Ser Leu Thr Glu Val Ser Gly Phe Ile Pro 210 215 220 Glu Val Leu Asn Thr Phe Pro Ala Leu Leu Arg Ile Pro Gly Leu Ala 225 230 235 240 Asp Lys Val Phe Gln Gly Gln Lys Thr Phe Met Ala Leu Leu Asp Asn 245 250 255 Leu Leu Ala Glu Asn Arg Thr Thr Trp Asp Pro Ala Gln Pro Pro Arg 260 265 270 Asn Leu Thr Asp Ala Phe Leu Ala Glu Val Glu Lys Ala Lys Gly Asn 275 280 285 Pro Glu Ser Ser Phe Asn Asp Glu Asn Leu Arg Met Val Val Val Asp 290 295 300 Leu Phe Thr Ala Gly Met Val Thr Thr Ala Thr Thr Leu Thr Trp Ala 305 310 315 320 Leu Leu Leu Met Ile Leu Tyr Pro Asp Val Gln Arg Arg Val Gln Gln 325 330 335 Glu Ile Asp Glu Val Ile Gly Gln Val Arg Cys Pro Glu Met Thr Asp 340 345 350 Gln Ala His Met Pro Tyr Thr Asn Ala Val Ile His Glu Val Gln Arg 355 360 365 Phe Gly Asp Ile Ala Pro Leu Asn Leu Pro Arg Phe Thr Ser Cys Asp 370 375 380 Ile Glu Val Gln Asp Phe Val Ile Pro Lys Gly Thr Thr Leu Ile Ile 385 390 395 400 Asn Leu Ser Ser Val Leu Lys Asp Glu Thr Val Trp Glu Lys Pro His 405 410 415 Arg Phe His Pro Glu His Phe Leu Asp Ala Gln Gly Asn Phe Val Lys 420 425 430 His Glu Ala Phe Met Pro Phe Ser Ala Gly Arg Arg Ala Cys Leu Gly 435 440 445 Glu Pro Leu Ala Arg Met Glu Leu Phe Leu Phe Phe Thr Cys Leu Leu 450 455 460 Gln Arg Phe Ser Phe Ser Val Pro Val Gly Gln Pro Arg Pro Ser Thr 465 470 475 480 His Gly Phe Phe Ala Phe Pro Val Ala Pro Leu Pro Tyr Gln Leu Cys 485 490 495 Ala Val Val Arg Glu Gln Gly Leu 500 <210> SEQ ID NO 6 <211> LENGTH: 428 <212> TYPE: PRT <213> ORGANISM: Pseudomonas sp. <300> PUBLICATION INFORMATION: <308> DATABASE ACCESSION NUMBER: GenBank / P33006 <309> DATABASE ENTRY DATE: 1993-10-01 <313> RELEVANT RESIDUES: (1)..(428) <400> SEQUENCE: 6 Met Asp Ala Arg Ala Thr Ile Pro Glu His Ile Ala Arg Thr Val Ile 1 5 10 15 Leu Pro Gln Gly Tyr Ala Asp Asp Glu Val Ile Tyr Pro Ala Phe Lys 20 25 30 Trp Leu Arg Asp Glu Gln Pro Leu Ala Met Ala His Ile Glu Gly Tyr 35 40 45 Asp Pro Met Trp Ile Ala Thr Lys His Ala Asp Val Met Gln Ile Gly 50 55 60 Lys Gln Pro Gly Leu Phe Ser Asn Ala Glu Gly Ser Glu Ile Leu Tyr 65 70 75 80 Asp Gln Asn Asn Glu Ala Phe Met Arg Ser Ile Ser Gly Gly Cys Pro 85 90 95 His Val Ile Asp Ser Leu Thr Ser Met Asp Pro Pro Thr His Thr Ala 100 105 110 Tyr Arg Gly Leu Thr Leu Asn Trp Phe Gln Pro Ala Ser Ile Arg Lys 115 120 125 Leu Glu Glu Asn Ile Arg Arg Ile Ala Gln Ala Ser Val Gln Arg Leu 130 135 140 Leu Asp Phe Asp Gly Glu Cys Asp Phe Met Thr Asp Cys Ala Leu Tyr 145 150 155 160 Tyr Pro Leu His Val Val Met Thr Ala Leu Gly Val Pro Glu Asp Asp 165 170 175 Glu Pro Leu Met Leu Lys Leu Thr Gln Asp Phe Phe Gly Val His Glu 180 185 190 Pro Asp Glu Gln Ala Val Ala Ala Pro Arg Gln Ser Ala Asp Glu Ala 195 200 205 Ala Arg Arg Phe His Glu Thr Ile Ala Thr Phe Tyr Asp Tyr Phe Asn 210 215 220 Gly Phe Thr Val Asp Arg Arg Ser Cys Pro Lys Asp Asp Val Met Ser 225 230 235 240 Leu Leu Ala Asn Ser Lys Leu Asp Gly Asn Tyr Ile Asp Asp Lys Tyr 245 250 255 Ile Asn Ala Tyr Tyr Val Ala Ile Ala Thr Ala Gly His Asp Thr Thr 260 265 270 Ser Ser Ser Ser Gly Gly Ala Ile Ile Gly Leu Ser Arg Asn Pro Glu 275 280 285 Gln Leu Ala Leu Ala Lys Ser Asp Pro Ala Leu Ile Pro Arg Leu Val 290 295 300 Asp Glu Ala Val Arg Trp Thr Ala Pro Val Lys Ser Phe Met Arg Thr 305 310 315 320 Ala Leu Ala Asp Thr Glu Val Arg Gly Gln Asn Ile Lys Arg Gly Asp 325 330 335 Arg Ile Met Leu Ser Tyr Pro Ser Ala Asn Arg Asp Glu Glu Val Phe 340 345 350 Ser Asn Pro Asp Glu Phe Asp Ile Thr Arg Phe Pro Asn Arg His Leu 355 360 365 Gly Phe Gly Trp Gly Ala His Met Cys Leu Gly Gln His Leu Ala Lys 370 375 380 Leu Glu Met Lys Ile Phe Phe Glu Glu Leu Leu Pro Lys Leu Lys Ser 385 390 395 400 Val Glu Leu Ser Gly Pro Pro Arg Leu Val Ala Thr Asn Phe Val Gly 405 410 415 Gly Pro Lys Asn Val Pro Ile Arg Phe Thr Lys Ala 420 425 <210> SEQ ID NO 7 <211> LENGTH: 51 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: oligonucleotide <400> SEQUENCE: 7 cgcggatcca tcgatgctta ggaggtcata tgacaattaa agaaatgcct c 51 <210> SEQ ID NO 8 <211> LENGTH: 51 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: oligonucleotide <400> SEQUENCE: 8 ccggaattct taatgatgat gatgatgatg cccagcccac acgtcttttg c 51 <210> SEQ ID NO 9 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer <400> SEQUENCE: 9 catacaaact acgagctcga tattaaagaa ac 32 <210> SEQ ID NO 10 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer <400> SEQUENCE: 10 gtttctttaa tatcgagctc gtagtttgta tg 32 <210> SEQ ID NO 11 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer <400> SEQUENCE: 11 acaggatcca tcgatgctta ggaggtcata tg 32 <210> SEQ ID NO 12 <211> LENGTH: 19 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer <400> SEQUENCE: 12 gtgaaggaat accgccaag 19 <210> SEQ ID NO 13 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer pTBamHd <400> SEQUENCE: 13 gaaccggatc catgacaatt aaagaaatgc 30 <210> SEQ ID NO 14 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer Rev3250 <400> SEQUENCE: 14 ctattctcac tccgctgaaa ctgttg 26 <210> SEQ ID NO 15 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer pT235_F <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (1)..(32) <223> OTHER INFORMATION: where n may be a or g or c or t/u, unknown, or other <400> SEQUENCE: 15 gcgatgattt attannncat atgctaaacg ga 32 <210> SEQ ID NO 16 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer pT235_R <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (1)..(32) <223> OTHER INFORMATION: where n may be a or g or c or t/u, unknown, or other <400> SEQUENCE: 16 tagc atatgnnnta ataaatcatc gc 32 <210> SEQ ID NO 17 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer pT471_F <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (1)..(38) <223> OTHER INFORMATION: where n may be a or g or c or t/u, unknown, or other <400> SEQUENCE: 17 cagtctgcta aaaaagtann naaaaaggca gaaaacgc 38 <210> SEQ ID NO 18 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer pT471_R <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (1)..(38) <223> OTHER INFORMATION: where n may be a or g or c or t/u, unknown, or other <400> SEQUENCE: 18 gcgttttctg cctttttnnn tactttttta gcagactg 38 <210> SEQ ID NO 19 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer pT1024_F <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (1)..(33) <223> OTHER INFORMATION: where n may be a or g or c or t/u, unknown, or other <400> SEQUENCE: 19 gacgttcacc aagtgnnnga agcagacgct cgc 33 <210> SEQ ID NO 20 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer pT3074_R <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: (1)..(33) <223> OTHER INFORMATION: where n may be a or g or c or t/u, unknown, or other <400> SEQUENCE: 20 gcgagcgtct gcttcnnnca cttggtgaac gtc 33 <210> SEQ ID NO 21 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer A87F1 <400> SEQUENCE: 21 gcaggagacg ggttatttac aagctggacg 30 <210> SEQ ID NO 22 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer A87F2 <400> SEQUENCE: 22 cgtccagctt gtaaataacc cgtctcctgc 30 <210> SEQ ID NO 23 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Gly1 <400> SEQUENCE: 23 gcaggagacg ggttaggtca agctggacg 29 <210> SEQ ID NO 24 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Gly2 <400> SEQUENCE: 24 cgtccagctt gtacctaacc cgtctcctgc 30 <210> SEQ ID NO 25 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Trp1 <400> SEQUENCE: 25 gcaggagacg ggttatggac aagctggacg 30 <210> SEQ ID NO 26 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Trp2 <400> SEQUENCE: 26 cgtccagctt gtccataacc cgtctcctgc 30 <210> SEQ ID NO 27 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87His1 <400> SEQUENCE: 27 gcaggagacg ggttacacac aagctggacg 30 <210> SEQ ID NO 28 <211> LENGTH: 31 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87His2 <400> SEQUENCE: 28 cgtccagctt gtgtgtaacc cgtcctcctg c 31 <210> SEQ ID NO 29 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Asn1 <400> SEQUENCE: 29 gcaggagacg ggttaaacac aagctggacg 30 <210> SEQ ID NO 30 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Asn2 <400> SEQUENCE: 30 cgtccagctt gtgtttaacc cgtctcctgc 30 <210> SEQ ID NO 31 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Asp1 <400> SEQUENCE: 31 gcaggagacg ggttagatac aagctggacg 30 <210> SEQ ID NO 32 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Asp2 <400> SEQUENCE: 32 cgtccagctg tatctaaccc gtctcctgc 29 <210> SEQ ID NO 33 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Arg1 <400> SEQUENCE: 33 gcaggagacg ggttacgtac aagctggacg 30 <210> SEQ ID NO 34 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Arg2 <400> SEQUENCE: 34 cgtccagctt gtacgtaacc cgtctcctgc 30 <210> SEQ ID NO 35 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Val1 <400> SEQUENCE: 35 gcaggagacg ggttagttac aagctggacg 30 <210> SEQ ID NO 36 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Val2 <400> SEQUENCE: 36 cgtccagctt gtaactaacc cgtctcctgc 30 <210> SEQ ID NO 37 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Ile1 <400> SEQUENCE: 37 gcaggagacg ggttaattac aagctggacg 30 <210> SEQ ID NO 38 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Ile2 <400> SEQUENCE: 38 cgtccagctt gtaattaacc cgtctcctgc 30 <210> SEQ ID NO 39 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Lys1 <400> SEQUENCE: 39 gcaggagacg ggttaaaaac aagctggacg 30 <210> SEQ ID NO 40 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer F87Lys2 <400> SEQUENCE: 40 cgtccagctt gtttttaacc cgtctcctgc 30 <210> SEQ ID NO 41 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer Bamfor <400> SEQUENCE: 41 acaggatcca tcgatgctta ggaggtcata tg 32 <210> SEQ ID NO 42 <211> LENGTH: 19 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: PCR primer Sacrev <400> SEQUENCE: 42 gtgaaggaat accgccaag 19 

What is claimed is:
 1. An isolated nucleic acid encoding a cytochrome P450 variant which has a higher capability than the corresponding wild-type cytochrome P450 to oxidize at least one substrate selected from an alkane comprising a carbon-chain of no more than 8 carbons and an alkene comprising a carbon-chain no more than 8 carbons, wherein the wild-type cytochrome P450 comprises an amino acid sequence having at least 60% sequence identity to SEQ ID NO: 2, and the cytochrome P450 variant comprises an amino acid substitution at a residue corresponding to a residue of SEQ ID NO: 2 selected from V78, H236, and E252.
 2. The isolated nucleic acid of claim 1, wherein the amino acid sequence has at least 80% sequence identity to SEQ ID NO:
 2. 3. The isolated nucleic acid of claim 2, wherein the amino acid sequence is SEQ ID NO:
 2. 4. The isolated nucleic acid of claim 1, wherein the higher capability is a higher maximum turnover rate of the substrate into an an oxidized product.
 5. The isolated nucleic acid of claim 4, wherein the maximum turnover rate of the variant is at least 5 times the maximum turnover rate of the wild-type.
 6. The isolated nucleic acid of claim 5, wherein the maximum turnover rate of the variant is at least 10 times the maximum turnover rate of the wild-type.
 7. The isolated nucleic acid of claim 5, wherein the substrate is an alkane and the capability to oxidize is capability to hydroxylate.
 8. The isolated nucleic acid of claim 7, wherein the alkane is selected from octane, hexane, cyclohexane, pentane, butane, and propane.
 9. The isolated nucleic acid of claim 8, wherein the cytochrome P450 BM-3 variant has a higher capability than wild-type cytochrome P450 BM-3 to oxidize octane, hexane, cyclohexane, pentane, butane, and propane.
 10. The isolated nucleic acid of claim 5, wherein the substrate is an alkene and the capability to oxidize is the capability to epoxidate or hydroxylate.
 11. The isolated nucleic acid of claim 10, wherein the alkene is selected from 1-hexene, 2-hexene, 3-hexene, cyclohexene, isoprene, allyl chloride, propene, and styrene.
 12. The isolated nucleic acid of claim 11, wherein the cytochrome P450 BM-3 variant has a higher capability than wild-type cytochrome P450 BM-3 to oxidize 1-hexene, 2-hexene, 3-hexene, cyclohexene, isoprene, allyl chloride, propene, and styrene.
 13. The isolated nucleic acid of claim 1, wherein the cytochrome P450 variant comprises amino acid substitutions at residues corresponding to at least two residues of SEQ ID NO: 2 selected from V78, H236, and E252.
 14. The isolated nucleic acid of claim 1, wherein the cytochrome P450 variant comprises amino acid substitutions at residues corresponding to V78, H236, and E252 of SEQ ID NO:
 2. 15. The isolated nucleic acid of claim 1, wherein the amino acid substitution is selected from V78A, H236Q, and E252G.
 16. The isolated nucleic acid of claim 3, wherein the cytochrome P450 variant comprises amino acid substitutions at two residues selected from V78, H236, and E252.
 17. The isolated nucleic acid of claim 16, wherein the cytochrome P450 variant comprises amino acid substitutions at V78, H236, and E252.
 18. The isolated nucleic acid of claim 17, wherein the amino acid substitutions are V78A, H236Q, and E252G.
 19. The isolated nucleic acid of claim 17, wherein the cytochrome P450 variant comprises at least one further amino acid substitution at a residue selected from H138, T175, V178, A184, N186, F205, D217, S226, R255, A290, A295, L353, and G396.
 20. The isolated nucleic acid of claim 19, wherein the at least one further amino acid substitution is selected from H138Y, T175I, V178I, A184V, N186D, F205C, D217V, S226I, S226R, R255S, A290V, A295T, L353V, and G396M.
 21. The isolated nucleic acid of claim 20, wherein the cytochrome P450 variant comprises the amino acid substitutions V78A, H138Y, T175I, V1781, A184V, H236Q, E252G, R255S, A290V, A295T, and L353V.
 22. The isolated nucleic acid of claim 20, wherein the cytochrome P450 variant comprises the amino acid substitutions V78A, T175I, A184V, F205C, S226R, H236Q, E252G, R255S, A290V, and L353V.
 23. An isolated nucleic acid encoding a cytochrome P450 variant which has a higher organic solvent-resistance than the corresponding wild-type cytochrome P450, wherein the wild-type cytochrome P450 comprises an amino acid sequence having at least 60% sequence identity to SEQ ID NO: 2, and the cytochrome P450 variant comprises an amino acid substitution at a residue corresponding to a residue of SEQ ID NO: 2 selected from T235, R471, E494, and S1024.
 24. The isolated nucleic acid of claim 23, wherein the amino acid sequence has at least 80% sequence identity to SEQ ID NO:
 2. 25. The isolated nucleic acid of claim 24, wherein the amino acid sequence is SEQ ID NO:
 2. 26. The isolated nucleic acid of claim 23, wherein the organic solvent-resistance is higher maximum turnover rate of a substrate into an an oxidized product in a solution comprising an organic solvent.
 27. The isolated nucleic acid of claim 26, wherein the oxidized product is hydroxylated.
 28. The isolated nucleic acid of claim 26, wherein the organic solvent is selected from THF, DMSO, acetone, acetonitrile, methanol, ethanol, propanol, dioxane, and dimethylformamide (DMF).
 29. The isolated nucleic acid of claim 28, wherein the solution comprises 25% (v/v) DMSO, and the hydroxylation activity of the cytochrome P450 BM-3 variant is at least twice the hydroxylation activity of wild-type cytochrome P450 BM-3.
 30. The isolated nucleic acid of claim 28, wherein the solution comprises 2% (v/v) THF, and the hydroxylation activity of the cytochrome P450 BM-3 variant is at least twice the hydroxylation activity of wild-type cytochrome P450 BM-3.
 31. The isolated nucleic acid of claim 23, wherein the cytochrome P450 variant comprises amino acid substitutions at residues corresponding to at least two residues of SEQ ID NO: 2 selected from T235, R471, E494, and S1024.
 32. The isolated nucleic acid of claim 31, wherein the cytochrome P450 variant comprises amino acid substitutions at residues corresponding to at least three residues of SEQ ID NO: 2 selected from T235, R471, E494, and S1024.
 33. The isolated nucleic acid of claim 32, wherein the cytochrome P450 variant comprises amino acid substitutions at residues corresponding to T235, R471, E494, and S1024 of SEQ ID NO:
 2. 34. The isolated nucleic acid of claim 25, wherein the cytochrome P450 variant comprises amino acid substitutions at at least two residues selected from T235, R471, E494, and S1024.
 35. The isolated nucleic acid of claim 34, wherein the cytochrome P450 variant comprises amino acid substitutions at at least three residues selected from T235, R471, E494, and S
 1024. 36. The isolated nucleic acid of claim 35, wherein the cytochrome P450 variant comprises amino acid substitutions at T235, R471, E494, and S1024.
 37. The isolated nucleic acid of claim 36, wherein the amino acid substitutions are T235A, R471A, E494K, and S1024E.
 38. The isolated nucleic acid of claim 36, wherein the cytochrome P450 variant further comprises an amino acid substitution at residue F87.
 39. The isolated nucleic acid of claim 38, wherein the amino acid substitution is F87A.
 40. The isolated nucleic acid of claim 37, wherein the cytochrome P450 variant comprises the mutations F87A, T235A, R471A, E494K, and S1024E
 41. An isolated nucleic acid encoding a variant of a parent cytochrome P450 oxygenase, the variant having (i) a higher ability than the parent to oxidize a substrate selected from an alkane comprising a carbon-chain of no more than 8 carbons or alkene comprising a carbon-chain of less than 8 carbons; and (ii) at least one amino acid substitution in a secondary structural element of the cytochrome P450 heme domain selected from the helix B′ domain, the helix H domain, and the helix I domain, wherein the parent comprises an amino acid sequence having at least 60% sequence identity to SEQ ID NO:
 2. 42. The isolated nucleic acid of claim 41, wherein the amino acid sequence has at least 80% sequence identity to SEQ ID NO:
 2. 43. The isolated nucleic acid of claim 42, wherein the amino acid sequence is SEQ ID NO:
 2. 44. The isolated nucleic acid of claim 41, wherein the amino acid substitution is at a residue corresponding to a residue of SEQ ID NO: 2 selected from V78, H236, and E252.
 45. The isolated nucleic acid of claim 44, wherein the amino acid substitution is selected from V78A, H236Q, and E252G.
 46. The isolated nucleic acid of claim 41, further comprising at least one amino acid substitution at a residue corresponding to a residue of SEQ ID NO: 2 selected from H138, T175, V178, A184, F205, S226, R255, A290, A295, and L353.
 47. The isolated nucleic acid of claim 46, wherein the amino acid substitution is selected from V78A, H138Y, T175I, V178I, A184V, F205C, S226R, H236Q, E252G, R255S, A290V, A295T, and L353V.
 48. The isolated nucleic acid of claim 44, wherein the variant comprises amino acid substitutions at residues corresponding to amino acid residues V78, H236, and E252 of SEQ ID NO:
 2. 49. The isolated nucleic acid of claim 48, wherein the amino acid substitutions are V78A, H236Q, and E252G.
 50. The isolated nucleic acid of claim 43, wherein the variant comprises the amino acid substitutions V78A, H138Y, T175I, V178I, A184V, H236Q, E252G, R255S, A290V, A295T, and L353V.
 51. The isolated nucleic acid of claim 43, wherein the variant comprises the amino acid substitutions V78A, T175I, A184V, F205C, S226R, H236Q, E252G, R255S, A290V, and L353V.
 52. An isolated nucleic acid encoding a variant of a parent cytochrome P450 oxygenase, the variant having: (i) a higher organic solvent resistance than the parent; and (ii) at least one amino acid substitution in a secondary structural element of the selected from the helix H domain of the cytochrome P450 heme domain and the helix of the flavin domain; wherein the parent comprises an amino acid sequence having at least 60% sequence identity to SEQ ID NO:
 2. 53. The isolated nucleic acid of claim 52, wherein the amino acid sequence has at least 80% sequence identity to SEQ ID NO:
 2. 54. The isolated nucleic acid of claim 53, wherein the amino acid sequence is SEQ ID NO:
 2. 55. The isolated nucleic acid of claim 52, wherein the amino acid substitution is at a residue corresponding to a residue of SEQ ID NO: 2 selected from T235 and E494, and the cytochrome P450 variant comprises a further amino, acid substitution at a residue corresponding to a residue of SEQ ID NO: 2 selected from R471 and S1024.
 56. The isolated nucleic acid of claim 55, wherein the amino acid substitution is selected from T235A and E494K, and the further amino acid substitution is selected from R471A and S1024E.
 57. The isolated nucleic acid of claim 54, wherein the amino acid substitution is at a residue selected from T235 and E494, and the cytochrome P450 variant comprises a further amino acid substitution at a residue selected from R471 and S1024.
 58. The isolated nucleic acid of claim 57, wherein the amino acid substitution is selected from T235A and E494K, and the further amino acid substitution is selected from R471A and S1024E.
 59. The isolated nucleic acid of claim 52, further comprising an amino acid substitution at a residue corresponding to residue F87 of SEQ ID NO:
 2. 60. The isolated nucleic acid of claim 59, wherein the amino acid substitution is selected from F87A, F87G, F87V, F87I, F87F, F87W, F87D, F87N, F87H, F87K, and F87R.
 61. The isolated nucleic acid of claim 54, wherein the variant comprises amino acid substitutions at residues corresponding to amino acid residues T235, R471, E494, and S1024 of SEQ ID NO:
 2. 62. The isolated nucleic acid of claim 61, wherein the amino acid substitutions are T235A, R471A, E494K, and S1024E.
 63. The isolated nucleic acid of claim 62, further comprising the amino acid substitution F87A.
 64. An isolated nucleic acid encoding a cytochrome P450 variant, the cytochrome P450 variant comprising the amino acid substitutions V78A, H236Q, and E252G of SEQ ID NO:
 2. 65. The isolated nucleic acid of claim 64, further encoding the amino acid substitutions H138Y, T175I, V178I, A184V, N186D, D217V, S2261, R255S, A290V, A295T, L353V, and G396M, or the amino acid substitutions T175I, A184V, F205C, S226R, R255S, A290V, L353V.
 66. An isolated nucleic acid encoding a cytochrome P450 variant, the cytochrome P450 variant comprising the amino acid substitutions T235A, R471A, E494K, and S1024E of SEQ ID NO:
 2. 67. The isolated nucleic acid of claim 66, further encoding the amino acid substitution F87A. 